2025-05-07T19:42:32.7087905Z Current runner version: '2.323.0' 2025-05-07T19:42:32.7093485Z Runner name: 'i-053f9a24237032a22' 2025-05-07T19:42:32.7094437Z Machine name: 'ip-10-0-33-130' 2025-05-07T19:42:32.7097149Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:32.7099129Z Contents: read 2025-05-07T19:42:32.7099713Z Metadata: read 2025-05-07T19:42:32.7100256Z Packages: read 2025-05-07T19:42:32.7100730Z ##[endgroup] 2025-05-07T19:42:32.7103033Z Secret source: None 2025-05-07T19:42:32.7103905Z Prepare workflow directory 2025-05-07T19:42:32.7713745Z Prepare all required actions 2025-05-07T19:42:32.7751717Z Getting action download info 2025-05-07T19:42:32.9586295Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:33.2267618Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:33.7622039Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.10, 12.6.3, gcc) 2025-05-07T19:42:33.8562941Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:33.8695759Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:33.8706598Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:33.8708086Z ##[endgroup] 2025-05-07T19:42:34.9539966Z Runner Type: linux.24xlarge 2025-05-07T19:42:34.9540494Z Instance Type: c5.24xlarge 2025-05-07T19:42:34.9540819Z AMI Name: unknown 2025-05-07T19:42:34.9573701Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:39.9469553Z ##[group]Checking docker version 2025-05-07T19:42:39.9482978Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:39.9680334Z '1.44' 2025-05-07T19:42:39.9697874Z Docker daemon API version: '1.44' 2025-05-07T19:42:39.9698408Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:39.9897762Z '1.44' 2025-05-07T19:42:39.9907300Z Docker client API version: '1.44' 2025-05-07T19:42:39.9911608Z ##[endgroup] 2025-05-07T19:42:39.9914515Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:39.9919167Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=6c9639" 2025-05-07T19:42:40.0073776Z ##[command]/usr/bin/docker network prune --force --filter "label=6c9639" 2025-05-07T19:42:40.0209718Z ##[endgroup] 2025-05-07T19:42:40.0210058Z ##[group]Create local container network 2025-05-07T19:42:40.0219190Z ##[command]/usr/bin/docker network create --label 6c9639 github_network_94fc62e5ee044bf697e58aee19d01a64 2025-05-07T19:42:40.2673006Z 055fdf516889e73725d1222b29b45678df45d65eca75773d4d3ceb95c26e1c02 2025-05-07T19:42:40.2687950Z ##[endgroup] 2025-05-07T19:42:40.2710126Z ##[group]Starting job container 2025-05-07T19:42:40.2730003Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:40.4758915Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:40.5468618Z 1c3112c87ab2: Pulling fs layer 2025-05-07T19:42:41.1069551Z 1c3112c87ab2: Verifying Checksum 2025-05-07T19:42:42.8400476Z 1c3112c87ab2: Download complete 2025-05-07T19:42:42.8401558Z 1c3112c87ab2: Pull complete 2025-05-07T19:42:42.8565763Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:42.8616228Z Status: Downloaded newer image for amazonlinux:2023 2025-05-07T19:42:42.8645070Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:42.8741859Z ##[command]/usr/bin/docker create --name 0d427c0d6c6f41979bf3159e20b740ae_amazonlinux2023_c08d76 --label 6c9639 --workdir /__w/FBGEMM/FBGEMM --network github_network_94fc62e5ee044bf697e58aee19d01a64 --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:43.2024950Z 12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf 2025-05-07T19:42:43.2050605Z ##[command]/usr/bin/docker start 12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf 2025-05-07T19:42:43.6842946Z 12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf 2025-05-07T19:42:43.6864949Z ##[command]/usr/bin/docker ps --all --filter id=12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:43.7023960Z 12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf Up Less than a second 2025-05-07T19:42:43.7042772Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" 12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf 2025-05-07T19:42:43.7193513Z HOME=/github/home 2025-05-07T19:42:43.7194261Z GITHUB_ACTIONS=true 2025-05-07T19:42:43.7194580Z CI=true 2025-05-07T19:42:43.7195003Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:43.7213109Z ##[endgroup] 2025-05-07T19:42:43.7223804Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:43.7225711Z ##[endgroup] 2025-05-07T19:42:43.7311737Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:43.7312654Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:43.7313773Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:43.7314161Z env: 2025-05-07T19:42:43.7314482Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:43.7314845Z BUILD_ENV: build_binary 2025-05-07T19:42:43.7315208Z BUILD_TARGET: default 2025-05-07T19:42:43.7315596Z BUILD_VARIANT: cuda 2025-05-07T19:42:43.7316116Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:43.7316457Z ##[endgroup] 2025-05-07T19:42:44.5989788Z Amazon Linux 2023 repository 66 MB/s | 37 MB 00:00 2025-05-07T19:42:51.1710405Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:44 2025. 2025-05-07T19:42:51.7265497Z Dependencies resolved. 2025-05-07T19:42:51.7439855Z Nothing to do. 2025-05-07T19:42:51.7441440Z Complete! 2025-05-07T19:42:51.9897663Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:44 2025. 2025-05-07T19:42:52.0525504Z Dependencies resolved. 2025-05-07T19:42:52.0752910Z ======================================================================================== 2025-05-07T19:42:52.0754899Z Package Arch Version Repository Size 2025-05-07T19:42:52.0756978Z ======================================================================================== 2025-05-07T19:42:52.0758229Z Installing: 2025-05-07T19:42:52.0759501Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:52.0761172Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:52.0762836Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:52.0763435Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:52.0763949Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:52.0764555Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:52.0765060Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:52.0765681Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:52.0766214Z Installing dependencies: 2025-05-07T19:42:52.0766626Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:52.0767200Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:52.0768251Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:52.0768889Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:52.0769456Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:52.0891606Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:52.0892239Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:52.0892748Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:52.0893290Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:52.0893904Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:52.0894488Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:52.0895083Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:52.0896093Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:52.0896640Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:52.0897176Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:52.0897703Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:52.0898259Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:52.0898796Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:52.0899488Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:52.0900104Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:52.0900654Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:52.0901225Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:52.0901737Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:52.0902422Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:52.0902956Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:52.0903468Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:52.0904013Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:52.0904651Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:52.0905214Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:52.0905751Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:52.0906328Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:52.0906916Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:52.0907460Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:52.0908124Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:52.0908722Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:52.0909323Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:52.0910043Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:52.0910620Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:52.0911214Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:52.0911766Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:52.0912350Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:52.0912958Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:52.0913523Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:52.0914125Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:52.0914726Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:52.0915326Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:52.0916035Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:52.0916730Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:52.0917318Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:52.0917934Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:52.0918533Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:52.0919116Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:52.0919685Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:52.0920252Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:52.0920815Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:52.0921395Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:52.0921987Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:52.0922540Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:52.0923109Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:52.0923715Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:52.0924317Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:52.0924931Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:52.0925518Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:52.0926148Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:52.0926782Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:52.0927358Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:52.0927923Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:52.0928478Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:52.0929210Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:52.0929793Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:52.0930373Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:52.0931057Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:52.0931676Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:52.0932290Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:52.0932837Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:52.0933381Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:52.0933943Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:52.0934475Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:52.0935028Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:52.0935557Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:52.0936092Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:52.0936613Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:52.0937228Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:52.0937776Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:52.0938361Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:52.0938922Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:52.0939489Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:52.0940033Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:52.0940578Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:52.0941101Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:52.0941650Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:52.0942182Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:52.0942708Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:52.0943155Z Installing weak dependencies: 2025-05-07T19:42:52.0943597Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:52.0944202Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:52.0944795Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:52.0945371Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:52.0945940Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:52.0946687Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:52.0947251Z 2025-05-07T19:42:52.0947357Z Transaction Summary 2025-05-07T19:42:52.0947640Z ======================================================================================== 2025-05-07T19:42:52.0947994Z Install 107 Packages 2025-05-07T19:42:52.0948152Z 2025-05-07T19:42:52.0948328Z Total download size: 38 M 2025-05-07T19:42:52.0948591Z Installed size: 151 M 2025-05-07T19:42:52.0948861Z Downloading Packages: 2025-05-07T19:42:52.3845801Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 3.6 MB/s | 82 kB 00:00 2025-05-07T19:42:52.3951594Z (2/107): elfutils-debuginfod-client-0.188-3.amz 6.4 MB/s | 41 kB 00:00 2025-05-07T19:42:52.4194358Z (3/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 92 MB/s | 5.3 MB 00:00 2025-05-07T19:42:52.4241328Z (4/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 12 MB/s | 786 kB 00:00 2025-05-07T19:42:52.4293302Z (5/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 16 MB/s | 539 kB 00:00 2025-05-07T19:42:52.4320599Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 8.3 MB/s | 54 kB 00:00 2025-05-07T19:42:52.4516932Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 62 MB/s | 1.1 MB 00:00 2025-05-07T19:42:52.4682873Z (8/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 71 MB/s | 2.8 MB 00:00 2025-05-07T19:42:52.4916180Z (9/107): git-core-2.47.1-1.amzn2023.0.2.x86_64. 72 MB/s | 4.7 MB 00:00 2025-05-07T19:42:52.4972356Z (10/107): groff-base-1.22.4-7.amzn2023.0.2.x86_ 25 MB/s | 1.0 MB 00:00 2025-05-07T19:42:52.5001902Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 5.2 MB/s | 160 kB 00:00 2025-05-07T19:42:52.5045932Z (12/107): jansson-2.14-0.amzn2023.x86_64.rpm 6.8 MB/s | 46 kB 00:00 2025-05-07T19:42:52.5079869Z (13/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 8.3 MB/s | 62 kB 00:00 2025-05-07T19:42:52.5167100Z (14/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 85 MB/s | 1.6 MB 00:00 2025-05-07T19:42:52.5194089Z (15/107): less-608-2.amzn2023.0.2.x86_64.rpm 13 MB/s | 168 kB 00:00 2025-05-07T19:42:52.5204719Z (16/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 4.8 MB/s | 57 kB 00:00 2025-05-07T19:42:52.5275809Z (17/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 4.2 MB/s | 28 kB 00:00 2025-05-07T19:42:52.5330190Z (18/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 48 MB/s | 756 kB 00:00 2025-05-07T19:42:52.5349188Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 7.7 MB/s | 108 kB 00:00 2025-05-07T19:42:52.5390050Z (20/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 14 MB/s | 153 kB 00:00 2025-05-07T19:42:52.5426598Z (21/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 14 MB/s | 95 kB 00:00 2025-05-07T19:42:52.5441026Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 3.8 MB/s | 31 kB 00:00 2025-05-07T19:42:52.5481877Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 12 MB/s | 106 kB 00:00 2025-05-07T19:42:52.5513858Z (24/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 14 MB/s | 121 kB 00:00 2025-05-07T19:42:52.5520781Z (25/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 3.4 MB/s | 26 kB 00:00 2025-05-07T19:42:52.5602775Z (26/107): nano-8.3-1.amzn2023.x86_64.rpm 60 MB/s | 706 kB 00:00 2025-05-07T19:42:52.5627979Z (27/107): nano-default-editor-8.3-1.amzn2023.no 1.0 MB/s | 10 kB 00:00 2025-05-07T19:42:52.5664162Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 28 MB/s | 394 kB 00:00 2025-05-07T19:42:52.5716164Z (29/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 55 MB/s | 573 kB 00:00 2025-05-07T19:42:52.5795701Z (30/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 38 MB/s | 454 kB 00:00 2025-05-07T19:42:52.5832828Z (31/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 16 MB/s | 256 kB 00:00 2025-05-07T19:42:52.5885252Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 41 MB/s | 708 kB 00:00 2025-05-07T19:42:52.5938812Z (33/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 40 MB/s | 542 kB 00:00 2025-05-07T19:42:52.5954332Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 7.9 MB/s | 93 kB 00:00 2025-05-07T19:42:52.5974489Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 5.3 MB/s | 41 kB 00:00 2025-05-07T19:42:52.6031718Z (36/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 25 MB/s | 179 kB 00:00 2025-05-07T19:42:52.6048642Z (37/107): perl-AutoLoader-5.74-477.amzn2023.0.6 2.4 MB/s | 22 kB 00:00 2025-05-07T19:42:52.6065489Z (38/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 3.3 MB/s | 29 kB 00:00 2025-05-07T19:42:52.6086962Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 4.2 MB/s | 22 kB 00:00 2025-05-07T19:42:52.6126766Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 9.3 MB/s | 55 kB 00:00 2025-05-07T19:42:52.6148012Z (41/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 3.3 MB/s | 26 kB 00:00 2025-05-07T19:42:52.6160145Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 5.2 MB/s | 36 kB 00:00 2025-05-07T19:42:52.6182497Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 5.1 MB/s | 26 kB 00:00 2025-05-07T19:42:52.6224568Z (44/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 2.8 MB/s | 15 kB 00:00 2025-05-07T19:42:52.6332801Z (45/107): perl-Encode-3.15-462.amzn2023.0.2.x86 102 MB/s | 1.7 MB 00:00 2025-05-07T19:42:52.6348827Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.5 MB/s | 41 kB 00:00 2025-05-07T19:42:52.6359672Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 2.7 MB/s | 31 kB 00:00 2025-05-07T19:42:52.6387002Z (48/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 4.3 MB/s | 21 kB 00:00 2025-05-07T19:42:52.6425476Z (49/107): perl-File-Basename-2.85-477.amzn2023. 3.1 MB/s | 18 kB 00:00 2025-05-07T19:42:52.6440333Z (50/107): perl-File-Find-1.37-477.amzn2023.0.6. 3.5 MB/s | 26 kB 00:00 2025-05-07T19:42:52.6455788Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 5.2 MB/s | 36 kB 00:00 2025-05-07T19:42:52.6486077Z (52/107): perl-File-Temp-0.231.100-2.amzn2023.0 10 MB/s | 60 kB 00:00 2025-05-07T19:42:52.6515516Z (53/107): perl-File-stat-1.09-477.amzn2023.0.6. 3.3 MB/s | 17 kB 00:00 2025-05-07T19:42:52.6533395Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 2.3 MB/s | 16 kB 00:00 2025-05-07T19:42:52.6551469Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 9.3 MB/s | 60 kB 00:00 2025-05-07T19:42:52.6567196Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 3.2 MB/s | 16 kB 00:00 2025-05-07T19:42:52.6590650Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 8.4 MB/s | 42 kB 00:00 2025-05-07T19:42:52.6612637Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 9.9 MB/s | 56 kB 00:00 2025-05-07T19:42:52.6634511Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 15 MB/s | 87 kB 00:00 2025-05-07T19:42:52.6650505Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 7.4 MB/s | 42 kB 00:00 2025-05-07T19:42:52.6685209Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 30 MB/s | 218 kB 00:00 2025-05-07T19:42:52.6706103Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 3.5 MB/s | 23 kB 00:00 2025-05-07T19:42:52.6717279Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 4.9 MB/s | 31 kB 00:00 2025-05-07T19:42:52.6740448Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.6 MB/s | 13 kB 00:00 2025-05-07T19:42:52.6777650Z (65/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 4.2 MB/s | 23 kB 00:00 2025-05-07T19:42:52.6823073Z (66/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 40 MB/s | 392 kB 00:00 2025-05-07T19:42:52.6841429Z (67/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 9.5 MB/s | 97 kB 00:00 2025-05-07T19:42:52.6869049Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 11 MB/s | 85 kB 00:00 2025-05-07T19:42:52.6893556Z (69/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 4.9 MB/s | 20 kB 00:00 2025-05-07T19:42:52.6938941Z (70/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 32 MB/s | 215 kB 00:00 2025-05-07T19:42:52.6959856Z (71/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 7.5 MB/s | 84 kB 00:00 2025-05-07T19:42:52.6981138Z (72/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 4.7 MB/s | 41 kB 00:00 2025-05-07T19:42:52.7000371Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 12 MB/s | 71 kB 00:00 2025-05-07T19:42:52.7022516Z (74/107): perl-SelectSaver-1.02-477.amzn2023.0. 2.3 MB/s | 12 kB 00:00 2025-05-07T19:42:52.7039206Z (75/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 10 MB/s | 55 kB 00:00 2025-05-07T19:42:52.7073713Z (76/107): perl-Storable-3.21-458.amzn2023.0.2.x 14 MB/s | 96 kB 00:00 2025-05-07T19:42:52.7085301Z (77/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 2.4 MB/s | 15 kB 00:00 2025-05-07T19:42:52.7109661Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 7.5 MB/s | 48 kB 00:00 2025-05-07T19:42:52.7132129Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 4.1 MB/s | 22 kB 00:00 2025-05-07T19:42:52.7148228Z (80/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 6.3 MB/s | 36 kB 00:00 2025-05-07T19:42:52.7162014Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 3.4 MB/s | 17 kB 00:00 2025-05-07T19:42:52.7185853Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 4.7 MB/s | 22 kB 00:00 2025-05-07T19:42:52.7202176Z (83/107): perl-Time-Local-1.300-5.amzn2023.0.2. 6.6 MB/s | 34 kB 00:00 2025-05-07T19:42:52.7232265Z (84/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 18 MB/s | 108 kB 00:00 2025-05-07T19:42:52.7248544Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 2.8 MB/s | 17 kB 00:00 2025-05-07T19:42:52.7286835Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 2.9 MB/s | 23 kB 00:00 2025-05-07T19:42:52.7296659Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 2.3 MB/s | 14 kB 00:00 2025-05-07T19:42:52.7320173Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 11 MB/s | 71 kB 00:00 2025-05-07T19:42:52.7352130Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 3.0 MB/s | 15 kB 00:00 2025-05-07T19:42:52.7385330Z (90/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 16 MB/s | 126 kB 00:00 2025-05-07T19:42:52.7539357Z (91/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 96 MB/s | 2.0 MB 00:00 2025-05-07T19:42:52.7552443Z (92/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 1.4 MB/s | 29 kB 00:00 2025-05-07T19:42:52.7565662Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 2.7 MB/s | 46 kB 00:00 2025-05-07T19:42:52.7606758Z (94/107): perl-overloading-0.02-477.amzn2023.0. 2.0 MB/s | 13 kB 00:00 2025-05-07T19:42:52.7623382Z (95/107): perl-parent-0.238-458.amzn2023.0.2.no 2.9 MB/s | 14 kB 00:00 2025-05-07T19:42:52.7642494Z (96/107): perl-podlators-4.14-458.amzn2023.0.2. 16 MB/s | 112 kB 00:00 2025-05-07T19:42:52.7654996Z (97/107): perl-subs-1.03-477.amzn2023.0.6.noarc 2.6 MB/s | 12 kB 00:00 2025-05-07T19:42:52.7684369Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.1 MB/s | 13 kB 00:00 2025-05-07T19:42:52.7822289Z (99/107): sudo-python-plugin-1.9.15-1.p5.amzn20 4.2 MB/s | 56 kB 00:00 2025-05-07T19:42:52.7898682Z (100/107): shadow-utils-4.9-12.amzn2023.0.4.x86 45 MB/s | 1.1 MB 00:00 2025-05-07T19:42:52.7961678Z (101/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 42 MB/s | 1.3 MB 00:00 2025-05-07T19:42:52.8017647Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 33 MB/s | 613 kB 00:00 2025-05-07T19:42:52.8222307Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 35 MB/s | 879 kB 00:00 2025-05-07T19:42:52.8344683Z (104/107): util-linux-2.37.4-1.amzn2023.0.4.x86 59 MB/s | 2.2 MB 00:00 2025-05-07T19:42:52.8383809Z (105/107): util-linux-core-2.37.4-1.amzn2023.0. 12 MB/s | 432 kB 00:00 2025-05-07T19:42:52.8441734Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 36 MB/s | 779 kB 00:00 2025-05-07T19:42:52.8456611Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 6.7 MB/s | 42 kB 00:00 2025-05-07T19:42:52.8481441Z -------------------------------------------------------------------------------- 2025-05-07T19:42:52.8482373Z Total 49 MB/s | 38 MB 00:00 2025-05-07T19:42:53.8997515Z Running transaction check 2025-05-07T19:42:53.9469034Z Transaction check succeeded. 2025-05-07T19:42:53.9469365Z Running transaction test 2025-05-07T19:42:54.3151895Z Transaction test succeeded. 2025-05-07T19:42:54.3152287Z Running transaction 2025-05-07T19:42:55.0038685Z Preparing : 1/1 2025-05-07T19:42:55.0186646Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:55.0421296Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:55.0620578Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:55.0666715Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:55.0739178Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:55.0834810Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:55.1102706Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:55.1162341Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:55.1215866Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:55.1708614Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:55.1773037Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:55.2052002Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:55.2102301Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:55.2153241Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:55.2206540Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:55.2257987Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:55.2395476Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:55.2438409Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:55.2485228Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:55.2546619Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:55.2598513Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:55.2637141Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:55.3052858Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:55.3122198Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:55.3256100Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:55.3682596Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:55.3852880Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:55.4644525Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:55.4645192Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:55.4645682Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:55.4645977Z 2025-05-07T19:42:55.4840905Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:55.5101179Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:55.5285044Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:55.5343431Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:55.6458177Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:55.7963301Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:55.8081880Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:55.8500526Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:55.8579624Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:55.8655904Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:55.8735528Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:55.8822836Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:55.8879110Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:55.8923597Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:55.8980971Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:55.9069914Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:55.9141841Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:55.9236537Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:55.9448386Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:55.9539785Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:55.9594762Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:55.9639114Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:55.9697093Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:55.9753359Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:55.9812718Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:55.9903971Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:55.9965745Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:56.0015914Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:56.0075480Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:56.0135226Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:56.0190402Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:56.0232574Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:56.0285069Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:56.0354023Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:56.0413980Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:56.0518120Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:56.0599394Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:56.0660019Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:56.0710208Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:56.0753192Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:56.0838718Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:56.0941283Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:56.1017763Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:56.1076078Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:56.1138723Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:56.1212031Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:56.1274546Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:56.1334673Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:56.1399754Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:56.1450099Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:56.1504296Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:56.1561923Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:56.1639671Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:56.1722750Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:56.1797207Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:56.1858111Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:56.1910149Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:56.1954394Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:56.2019226Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:56.2073959Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:56.2127628Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:56.2187909Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:56.2245079Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:56.2325000Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:56.2864984Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:56.3822079Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:56.3952812Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:56.4038181Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:42:56.4112700Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:42:56.4184664Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:42:56.4254091Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:42:56.4309861Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:42:56.4377354Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:56.4448771Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:42:56.4656118Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:42:56.4791786Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:42:56.4873059Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:42:56.5274896Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:42:56.6501107Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:42:56.6597697Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:56.6717095Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:56.7017563Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:42:56.7116945Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:56.7363652Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:42:56.7583160Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:56.7667632Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:56.7789395Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:42:57.5402509Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:57.5403180Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:42:57.5403952Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:42:57.5404583Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:42:57.5405291Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:42:57.5405981Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:57.5406582Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:42:57.5407222Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:42:57.5407781Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:42:57.5408770Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:42:57.5409410Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:42:57.5410014Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:42:57.5410661Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:42:57.5411240Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:42:57.5411891Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:42:57.5412531Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:42:57.5413106Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:57.5413730Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:57.5414323Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:42:57.5414959Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:57.5415604Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:42:57.5416197Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:42:57.5416833Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:42:57.5417417Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:42:57.5418113Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:57.5418766Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:42:57.5419321Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:42:57.5420007Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:42:57.5420606Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:42:57.5421353Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:42:57.5422065Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:42:57.5422822Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:42:57.5423578Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:42:57.5424313Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:42:57.5425184Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:57.5426315Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:42:57.5427148Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:57.5427912Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:42:57.5428620Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:42:57.5429387Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:42:57.5430245Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:42:57.5431134Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:57.5432009Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:42:57.5432936Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:42:57.5433771Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:42:57.5434548Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:42:57.5435620Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:57.5436530Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:57.5437158Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:42:57.5437699Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:42:57.5438264Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:42:57.5438802Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:42:57.5439355Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:42:57.5439916Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:42:57.5440450Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:57.5440997Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:42:57.5441534Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:42:57.5442109Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:42:57.5442647Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:42:57.5443158Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:42:57.5443688Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:42:57.5444223Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:42:57.5444772Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:57.5445305Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:42:57.5445854Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:57.5446599Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:42:57.5447133Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:42:57.5447676Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:42:57.5448207Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:42:57.5448757Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:42:57.5449312Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:42:57.5449845Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:57.5450578Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:42:57.5451101Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:42:57.5451665Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:57.5452195Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:57.5452728Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:42:57.5453270Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:42:57.5453814Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:42:57.5454373Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:42:57.5454912Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:42:57.5455480Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:42:57.5456046Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:42:57.5456576Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:42:57.5457197Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:57.5457711Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:42:57.5458247Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:42:57.5458770Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:42:57.5459307Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:42:57.5459843Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:57.5460357Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:42:57.5460897Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:42:57.5461395Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:42:57.5461935Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:42:57.5462476Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:42:57.5463025Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:57.5463559Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:42:57.5464076Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:42:57.5464609Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:42:57.5465118Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:42:57.5465635Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:42:57.5466174Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:42:57.5466714Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:42:57.5467308Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:42:57.5467811Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:57.5468342Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:42:57.5468846Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:57.6479639Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:57.6480032Z 2025-05-07T19:42:57.6480119Z Installed: 2025-05-07T19:42:57.6480464Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:42:57.6481221Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6481765Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:42:57.6482351Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6482930Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6483430Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6483930Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6484459Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:57.6484971Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:42:57.6485524Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6486050Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:42:57.6486594Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:42:57.6487275Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:42:57.6487810Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:42:57.6488368Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6488889Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6489443Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6489973Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:42:57.6490558Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6491153Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:57.6491786Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6492344Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6492957Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6493524Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6494118Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6494680Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:42:57.6495227Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:42:57.6495940Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6496469Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:42:57.6497029Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:42:57.6497684Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:57.6498220Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:57.6498755Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:42:57.6499236Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6499778Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6500318Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6500867Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6501402Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:42:57.6501989Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6502565Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6503098Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:42:57.6503646Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6504173Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6504723Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6505254Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6505764Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:42:57.6506315Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:42:57.6506843Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6507399Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6508022Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6508554Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:42:57.6509090Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:42:57.6509588Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6510119Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6510631Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:42:57.6511157Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6511668Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:57.6512164Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:42:57.6512663Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6513156Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:42:57.6513686Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:42:57.6514215Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6514721Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6515254Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:42:57.6515781Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6516552Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:42:57.6517178Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6517740Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6518311Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:42:57.6518866Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:42:57.6519428Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:42:57.6519958Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:42:57.6520512Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6521075Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6521633Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6522953Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6523494Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6524079Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:42:57.6524646Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:42:57.6525213Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6525820Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:42:57.6526421Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:42:57.6527020Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:42:57.6527571Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:42:57.6528142Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6528788Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:42:57.6529333Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6529995Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6530703Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6531456Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:42:57.6532216Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6533022Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:42:57.6533890Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6534675Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6535277Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:42:57.6535798Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:42:57.6536366Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6536882Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:42:57.6537419Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:42:57.6537940Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:57.6538455Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:57.6539009Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:42:57.6539501Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:42:57.6540018Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:57.6540546Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:57.6541087Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:42:57.6541599Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:42:57.6541892Z 2025-05-07T19:42:57.6541989Z Complete! 2025-05-07T19:42:57.7313718Z ##[group]Run actions/checkout@v4 2025-05-07T19:42:57.7314032Z with: 2025-05-07T19:42:57.7314242Z submodules: true 2025-05-07T19:42:57.7314466Z repository: pytorch/FBGEMM 2025-05-07T19:42:57.7314896Z token: *** 2025-05-07T19:42:57.7315091Z ssh-strict: true 2025-05-07T19:42:57.7315309Z ssh-user: git 2025-05-07T19:42:57.7315520Z persist-credentials: true 2025-05-07T19:42:57.7315778Z clean: true 2025-05-07T19:42:57.7316107Z sparse-checkout-cone-mode: true 2025-05-07T19:42:57.7316725Z fetch-depth: 1 2025-05-07T19:42:57.7316959Z fetch-tags: false 2025-05-07T19:42:57.7317244Z show-progress: true 2025-05-07T19:42:57.7317485Z lfs: false 2025-05-07T19:42:57.7317700Z set-safe-directory: true 2025-05-07T19:42:57.7317962Z env: 2025-05-07T19:42:57.7318178Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:57.7318498Z BUILD_ENV: build_binary 2025-05-07T19:42:57.7318744Z BUILD_TARGET: default 2025-05-07T19:42:57.7318992Z BUILD_VARIANT: cuda 2025-05-07T19:42:57.7319280Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:57.7319529Z ##[endgroup] 2025-05-07T19:42:57.7360350Z ##[command]/usr/bin/docker exec 12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:42:58.0668481Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:42:58.0670008Z ##[group]Getting Git version info 2025-05-07T19:42:58.0670387Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:42:58.0670966Z [command]/usr/bin/git version 2025-05-07T19:42:58.0671329Z git version 2.47.1 2025-05-07T19:42:58.0672437Z ##[endgroup] 2025-05-07T19:42:58.0677693Z Temporarily overriding HOME='/__w/_temp/de73fc01-f3fd-4827-ab66-7862b3bedefa' before making global git config changes 2025-05-07T19:42:58.0678502Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:42:58.0679188Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:42:58.0702793Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:42:58.0719963Z https://github.com/pytorch/FBGEMM 2025-05-07T19:42:58.0739289Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:42:58.0744672Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:42:58.0760834Z HEAD 2025-05-07T19:42:58.0795811Z ##[endgroup] 2025-05-07T19:42:58.0796455Z [command]/usr/bin/git submodule status 2025-05-07T19:42:58.1162157Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:42:58.1225805Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (remotes/origin/FBGEMM) 2025-05-07T19:42:58.1318484Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:42:58.1389684Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (remotes/origin/FBGEMM) 2025-05-07T19:42:58.1601158Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (release-1.8.0-3335-gf8d7d77c) 2025-05-07T19:42:58.1674710Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (remotes/origin/mmelesse-9-g4200844) 2025-05-07T19:42:58.1719665Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (v3.11.2-84-g9cca280a) 2025-05-07T19:42:58.1735079Z ##[group]Cleaning the repository 2025-05-07T19:42:58.1737462Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:42:58.2620454Z Removing amdgpu-install_6.2.60204-1_all.deb 2025-05-07T19:42:58.2620975Z Removing collect_env.py 2025-05-07T19:42:58.2621276Z Removing fbgemm_gpu/_skbuild/ 2025-05-07T19:42:58.2621672Z Removing fbgemm_gpu/bench/verify_fp16_stochastic_benchmark.hip 2025-05-07T19:42:58.2622135Z Removing fbgemm_gpu/codegen/genscript/__pycache__/ 2025-05-07T19:42:58.2622690Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_cpu_template_hip.cpp 2025-05-07T19:42:58.2623421Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu_hip.cpp 2025-05-07T19:42:58.2624070Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_hip.cpp 2025-05-07T19:42:58.2624988Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.hip 2025-05-07T19:42:58.2625730Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_nbit_host_template.hip 2025-05-07T19:42:58.2626517Z Removing fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_nbit_kernel_template.hip 2025-05-07T19:42:58.2627260Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu_hip.cpp 2025-05-07T19:42:58.2628170Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_cpu_approx_template_hip.cpp 2025-05-07T19:42:58.2628941Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_cpu_template_hip.cpp 2025-05-07T19:42:58.2629747Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_device_kernel_template_hip.cuh 2025-05-07T19:42:58.2630622Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_grad_template.hip 2025-05-07T19:42:58.2631377Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_host_cpu_template_hip.cpp 2025-05-07T19:42:58.2632151Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_host_template_hip.cpp 2025-05-07T19:42:58.2632928Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_indice_weights_template.hip 2025-05-07T19:42:58.2633727Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_kernel_cta_template.hip 2025-05-07T19:42:58.2634515Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_kernel_warp_template.hip 2025-05-07T19:42:58.2635281Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_meta_template_hip.cpp 2025-05-07T19:42:58.2636125Z Removing fbgemm_gpu/codegen/training/backward/embedding_backward_split_template.hip 2025-05-07T19:42:58.2636785Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu_hip.cpp 2025-05-07T19:42:58.2637538Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_nobag_small_template.hip 2025-05-07T19:42:58.2638322Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_template.hip 2025-05-07T19:42:58.2639046Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_kernel_v2_template.hip 2025-05-07T19:42:58.2639764Z Removing fbgemm_gpu/codegen/training/forward/embedding_forward_split_template.hip 2025-05-07T19:42:58.2640449Z Removing fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host_hip.cpp 2025-05-07T19:42:58.2641197Z Removing fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops_hip.cpp 2025-05-07T19:42:58.2641958Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_device_kernel_template_hip.cuh 2025-05-07T19:42:58.2642795Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_host_template_hip.cpp 2025-05-07T19:42:58.2643572Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_kernel_template.hip 2025-05-07T19:42:58.2644296Z Removing fbgemm_gpu/codegen/training/optimizer/embedding_optimizer_split_template.hip 2025-05-07T19:42:58.2674433Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_autograd_template_hip.cpp 2025-05-07T19:42:58.2675285Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_cpu_wrapper_template_hip.cpp 2025-05-07T19:42:58.2676170Z Removing fbgemm_gpu/codegen/training/pt2/embedding_split_host_pt2_hip_wrapper_template.cpp 2025-05-07T19:42:58.2676833Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu_hip.cpp 2025-05-07T19:42:58.2677429Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_host_hip.cpp 2025-05-07T19:42:58.2677959Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.hip 2025-05-07T19:42:58.2678442Z Removing fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.hip 2025-05-07T19:42:58.2678856Z Removing fbgemm_gpu/dist/ 2025-05-07T19:42:58.2679228Z Removing fbgemm_gpu/experimental/example/src/cutlass_sgemm_nn.hip 2025-05-07T19:42:58.2679969Z Removing fbgemm_gpu/experimental/example/src/example_nccl_hip.cpp 2025-05-07T19:42:58.2680516Z Removing fbgemm_gpu/experimental/gen_ai/src/attention/gqa_attn_splitk.hip 2025-05-07T19:42:58.2681069Z Removing fbgemm_gpu/experimental/gen_ai/src/coalesce/coalesce.hip 2025-05-07T19:42:58.2681557Z Removing fbgemm_gpu/experimental/gen_ai/src/comm/car.hip 2025-05-07T19:42:58.2682000Z Removing fbgemm_gpu/experimental/gen_ai/src/comm/car_hip.cpp 2025-05-07T19:42:58.2682546Z Removing fbgemm_gpu/experimental/gen_ai/src/gather_scatter/gather_scatter.hip 2025-05-07T19:42:58.2683193Z Removing fbgemm_gpu/experimental/gen_ai/src/kv_cache/kv_cache.hip 2025-05-07T19:42:58.2683727Z Removing fbgemm_gpu/experimental/gen_ai/src/kv_cache/kv_cache_hip.cpp 2025-05-07T19:42:58.2684253Z Removing fbgemm_gpu/experimental/gen_ai/src/moe/index_shuffling.hip 2025-05-07T19:42:58.2684976Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/bf16_grouped/kernels/bf16_grouped_common_hip.h 2025-05-07T19:42:58.2685897Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise/kernels/fp8_rowwise_common_hip.h 2025-05-07T19:42:58.2686873Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_batched/kernels/fp8_rowwise_batched_common_hip.h 2025-05-07T19:42:58.2687909Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fp8_rowwise_grouped_common_hip.h 2025-05-07T19:42:58.2688903Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fused_moe/fused_moe_op_hip.cpp 2025-05-07T19:42:58.2689568Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cublas_utils_hip.h 2025-05-07T19:42:58.2690224Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16bf16bf16_grouped.hip 2025-05-07T19:42:58.2690920Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16.hip 2025-05-07T19:42:58.2691665Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16_rowwise_batched.hip 2025-05-07T19:42:58.2692463Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/bf16i4bf16_shuffled_grouped.hip 2025-05-07T19:42:58.2693198Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16.hip 2025-05-07T19:42:58.2693950Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_128_4_1_1_f.hip 2025-05-07T19:42:58.2694783Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_128_4_1_1_t.hip 2025-05-07T19:42:58.2695629Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_192_2_2_1_f.hip 2025-05-07T19:42:58.2696455Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_192_2_2_1_t.hip 2025-05-07T19:42:58.2697297Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_256_2_1_1_f.hip 2025-05-07T19:42:58.2698140Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_128_256_2_1_1_t.hip 2025-05-07T19:42:58.2698966Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_2_1_f.hip 2025-05-07T19:42:58.2699811Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_2_1_t.hip 2025-05-07T19:42:58.2700653Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_4_1_f.hip 2025-05-07T19:42:58.2701477Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_128_2_4_1_t.hip 2025-05-07T19:42:58.2702322Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_2_1_f.hip 2025-05-07T19:42:58.2703148Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_2_1_t.hip 2025-05-07T19:42:58.2704076Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_4_1_f.hip 2025-05-07T19:42:58.2704986Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_2_4_1_t.hip 2025-05-07T19:42:58.2705817Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_4_1_1_f.hip 2025-05-07T19:42:58.2706664Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_192_4_1_1_t.hip 2025-05-07T19:42:58.2707497Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_1_1_f.hip 2025-05-07T19:42:58.2708400Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_1_1_t.hip 2025-05-07T19:42:58.2709248Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_2_1_f.hip 2025-05-07T19:42:58.2710076Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_2_1_t.hip 2025-05-07T19:42:58.2710922Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_4_1_f.hip 2025-05-07T19:42:58.2711751Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_2_4_1_t.hip 2025-05-07T19:42:58.2712616Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_4_1_1_f.hip 2025-05-07T19:42:58.2713458Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_256_256_4_1_1_t.hip 2025-05-07T19:42:58.2714273Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_common_hip.cuh 2025-05-07T19:42:58.2715092Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f4f4bf16/f4f4bf16_manifest_hip.cuh 2025-05-07T19:42:58.2715892Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16.hip 2025-05-07T19:42:58.2716891Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_blockwise.hip 2025-05-07T19:42:58.2717628Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_cublas.hip 2025-05-07T19:42:58.2718346Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_lite.hip 2025-05-07T19:42:58.2719056Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise.hip 2025-05-07T19:42:58.2719960Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_128_128_2_1_1_t_f.hip 2025-05-07T19:42:58.2721027Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_256_128_2_1_1_f_t.hip 2025-05-07T19:42:58.2722090Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_128_256_128_4_4_1_f_t.hip 2025-05-07T19:42:58.2723141Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_128_128_1_1_1_f_f.hip 2025-05-07T19:42:58.2724172Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_16_128_1_1_1_f_f.hip 2025-05-07T19:42:58.2725213Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_256_128_1_1_1_f_f.hip 2025-05-07T19:42:58.2726258Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_256_128_2_1_1_f_f.hip 2025-05-07T19:42:58.2727284Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_32_128_2_1_1_f_f.hip 2025-05-07T19:42:58.2728335Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_64_64_128_2_1_1_f_f.hip 2025-05-07T19:42:58.2729396Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise/f8f8bf16_rowwise_common_hip.cuh 2025-05-07T19:42:58.2734805Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/common_hip.cuh 2025-05-07T19:42:58.2735965Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/dispatch_fp8_rowwise_batched_kernel_on_cluster_size_and_transpose.hip 2025-05-07T19:42:58.2737168Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/dispatch_fp8_rowwise_batched_kernel_on_tile_size.hip 2025-05-07T19:42:58.2738219Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/f8f8bf16_rowwise_batched.hip 2025-05-07T19:42:58.2739281Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/f8f8bf16_rowwise_batched_impl.hip 2025-05-07T19:42:58.2740236Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_batched/handle_transposition.hip 2025-05-07T19:42:58.2741077Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_rowwise_grouped.hip 2025-05-07T19:42:58.2741799Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8f8bf16_tensorwise.hip 2025-05-07T19:42:58.2742499Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_rowwise.hip 2025-05-07T19:42:58.2743185Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_shuffled.hip 2025-05-07T19:42:58.2743897Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/f8i4bf16_shuffled_grouped.hip 2025-05-07T19:42:58.2744586Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/i8i8bf16.hip 2025-05-07T19:42:58.2745231Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/i8i8bf16_dynamic.hip 2025-05-07T19:42:58.2746011Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/include/fp8_blockwise_cutlass_helpers_hip.h 2025-05-07T19:42:58.2747163Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions/mixed_dtype_utils.hip 2025-05-07T19:42:58.2747926Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/bf16_fast_gemv.hip 2025-05-07T19:42:58.2748598Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/bf16fp8bf16_fast_gemv.hip 2025-05-07T19:42:58.2749277Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/fp8fp8bf16_fast_gemv.hip 2025-05-07T19:42:58.2749960Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/fast_gemv.hip 2025-05-07T19:42:58.2750647Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/fast_gemv_hip.cuh 2025-05-07T19:42:58.2751328Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/fast_gemv/include/utility_hip.cuh 2025-05-07T19:42:58.2751923Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/quantize.hip 2025-05-07T19:42:58.2752446Z Removing fbgemm_gpu/experimental/gen_ai/src/quantize/quantize_hip.cpp 2025-05-07T19:42:58.2752921Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:42:58.2753298Z Removing fbgemm_gpu/fbgemm_gpu_nightly.egg-info/ 2025-05-07T19:42:58.2753728Z Removing fbgemm_gpu/include/fbgemm_gpu/cumem_utils_hip.h 2025-05-07T19:42:58.2754299Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_backward_template_helpers_hip.cuh 2025-05-07T19:42:58.2754903Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_forward_split_cpu_hip.h 2025-05-07T19:42:58.2755687Z Removing fbgemm_gpu/include/fbgemm_gpu/embedding_forward_template_helpers_hip.cuh 2025-05-07T19:42:58.2756340Z Removing fbgemm_gpu/include/fbgemm_gpu/layout_transform_ops_hip.cuh 2025-05-07T19:42:58.2756917Z Removing fbgemm_gpu/include/fbgemm_gpu/permute_multi_embedding_function_hip.h 2025-05-07T19:42:58.2757474Z Removing fbgemm_gpu/include/fbgemm_gpu/quantize_ops_hip.cuh 2025-05-07T19:42:58.2757920Z Removing fbgemm_gpu/include/fbgemm_gpu/sparse_ops_hip.cuh 2025-05-07T19:42:58.2758421Z Removing fbgemm_gpu/include/fbgemm_gpu/split_embeddings_utils_hip.cuh 2025-05-07T19:42:58.2758959Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/barrier_isolation_hip.cuh 2025-05-07T19:42:58.2759749Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/bench_utils_hip.cuh 2025-05-07T19:42:58.2760251Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/bitonic_sort_hip.cuh 2025-05-07T19:42:58.2760805Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/cub_namespace_postfix_hip.cuh 2025-05-07T19:42:58.2761403Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/cub_namespace_prefix_hip.cuh 2025-05-07T19:42:58.2761971Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/device_cache_flusher_hip.cuh 2025-05-07T19:42:58.2762536Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/device_properties_hip.cuh 2025-05-07T19:42:58.2763129Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/dispatch_macros_hip.h 2025-05-07T19:42:58.2763719Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/embedding_bounds_check_common_hip.cuh 2025-05-07T19:42:58.2764289Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/find_qparams_hip.cuh 2025-05-07T19:42:58.2764777Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/float_hip.cuh 2025-05-07T19:42:58.2765248Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/hip_prelude.cuh 2025-05-07T19:42:58.2765773Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/host_device_buffer_pair_hip.cuh 2025-05-07T19:42:58.2766357Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/inclusive_sum_scan_hip.cuh 2025-05-07T19:42:58.2766890Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/kernel_launcher_hip.cuh 2025-05-07T19:42:58.2767461Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/stochastic_rounding_hip.h 2025-05-07T19:42:58.2768206Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/vec2_hip.h 2025-05-07T19:42:58.2768674Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/rocm/weight_row_hip.h 2025-05-07T19:42:58.2769158Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/shared_memory_hip.cuh 2025-05-07T19:42:58.2769653Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding_hip.cuh 2025-05-07T19:42:58.2770184Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/tensor_accessor_builder_hip.h 2025-05-07T19:42:58.2770684Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/tensor_accessor_hip.h 2025-05-07T19:42:58.2771137Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec4_hip.cuh 2025-05-07T19:42:58.2771563Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec4acc_hip.cuh 2025-05-07T19:42:58.2771988Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vec_quant_hip.cuh 2025-05-07T19:42:58.2772417Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/vecn_hip.cuh 2025-05-07T19:42:58.2772840Z Removing fbgemm_gpu/include/fbgemm_gpu/utils/weight_row_hip.cuh 2025-05-07T19:42:58.2773346Z Removing fbgemm_gpu/src/dram_kv_embedding_cache/dram_kv_embedding_cache_hip.h 2025-05-07T19:42:58.2773912Z Removing fbgemm_gpu/src/dram_kv_embedding_cache/dram_kv_embedding_cache_wrapper_hip.h 2025-05-07T19:42:58.2774489Z Removing fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.hip 2025-05-07T19:42:58.2775064Z Removing fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu_hip.cpp 2025-05-07T19:42:58.2775579Z Removing fbgemm_gpu/src/histogram_binning_calibration_ops.hip 2025-05-07T19:42:58.2776021Z Removing fbgemm_gpu/src/input_combine_ops/input_combine.hip 2025-05-07T19:42:58.2776466Z Removing fbgemm_gpu/src/input_combine_ops/input_combine_cpu_hip.cpp 2025-05-07T19:42:58.2777043Z Removing fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.hip 2025-05-07T19:42:58.2777717Z Removing fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu_hip.cpp 2025-05-07T19:42:58.2778387Z Removing fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.hip 2025-05-07T19:42:58.2779006Z Removing fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.hip 2025-05-07T19:42:58.2779511Z Removing fbgemm_gpu/src/jagged_tensor_ops/common_hip.cuh 2025-05-07T19:42:58.2779967Z Removing fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.hip 2025-05-07T19:42:58.2780464Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.hip 2025-05-07T19:42:58.2781088Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.hip 2025-05-07T19:42:58.2781845Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.hip 2025-05-07T19:42:58.2782435Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.hip 2025-05-07T19:42:58.2783002Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.hip 2025-05-07T19:42:58.2783537Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.hip 2025-05-07T19:42:58.2784076Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.hip 2025-05-07T19:42:58.2784628Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.hip 2025-05-07T19:42:58.2785127Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.hip 2025-05-07T19:42:58.2785606Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.hip 2025-05-07T19:42:58.2786081Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu_hip.cpp 2025-05-07T19:42:58.2786622Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.hip 2025-05-07T19:42:58.2787169Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.hip 2025-05-07T19:42:58.2787690Z Removing fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.hip 2025-05-07T19:42:58.2788204Z Removing fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.hip 2025-05-07T19:42:58.2788745Z Removing fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.hip 2025-05-07T19:42:58.2789283Z Removing fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu_hip.cpp 2025-05-07T19:42:58.2789743Z Removing fbgemm_gpu/src/memory_utils/common_hip.cuh 2025-05-07T19:42:58.2790134Z Removing fbgemm_gpu/src/memory_utils/memory_utils.hip 2025-05-07T19:42:58.2790520Z Removing fbgemm_gpu/src/memory_utils/memory_utils_hip.cpp 2025-05-07T19:42:58.2790934Z Removing fbgemm_gpu/src/memory_utils/memory_utils_ops.hip 2025-05-07T19:42:58.2791360Z Removing fbgemm_gpu/src/memory_utils/memory_utils_ops_hip.cpp 2025-05-07T19:42:58.2791887Z Removing fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu_hip.cpp 2025-05-07T19:42:58.2792528Z Removing fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu_hip.cpp 2025-05-07T19:42:58.2793024Z Removing fbgemm_gpu/src/metric_ops/metric_ops.hip 2025-05-07T19:42:58.2793541Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function_hip.cpp 2025-05-07T19:42:58.2794165Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.hip 2025-05-07T19:42:58.2794797Z Removing fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu_hip.cpp 2025-05-07T19:42:58.2795441Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.hip 2025-05-07T19:42:58.2796154Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu_hip.cpp 2025-05-07T19:42:58.2797048Z Removing fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.hip 2025-05-07T19:42:58.2797780Z Removing fbgemm_gpu/src/ps_split_embeddings_cache/ps_split_table_batched_embeddings_hip.cpp 2025-05-07T19:42:58.2798454Z Removing fbgemm_gpu/src/ps_split_embeddings_cache/ps_table_batched_embeddings_hip.h 2025-05-07T19:42:58.2798984Z Removing fbgemm_gpu/src/quantize_ops/common_hip.cuh 2025-05-07T19:42:58.2799385Z Removing fbgemm_gpu/src/quantize_ops/mx/common_hip.cuh 2025-05-07T19:42:58.2799813Z Removing fbgemm_gpu/src/quantize_ops/mx_common_hip.cuh 2025-05-07T19:42:58.2800240Z Removing fbgemm_gpu/src/quantize_ops/quantize_bfloat16.hip 2025-05-07T19:42:58.2800709Z Removing fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.hip 2025-05-07T19:42:58.2801220Z Removing fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.hip 2025-05-07T19:42:58.2801756Z Removing fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.hip 2025-05-07T19:42:58.2802239Z Removing fbgemm_gpu/src/quantize_ops/quantize_hfp8.hip 2025-05-07T19:42:58.2802648Z Removing fbgemm_gpu/src/quantize_ops/quantize_msfp.hip 2025-05-07T19:42:58.2803064Z Removing fbgemm_gpu/src/quantize_ops/quantize_mx.hip 2025-05-07T19:42:58.2803566Z Removing fbgemm_gpu/src/quantize_ops/quantize_mx_hip.cuh 2025-05-07T19:42:58.2804028Z Removing fbgemm_gpu/src/quantize_ops/quantize_ops_cpu_hip.cpp 2025-05-07T19:42:58.2804521Z Removing fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.hip 2025-05-07T19:42:58.2804989Z Removing fbgemm_gpu/src/sparse_ops/common_hip.cuh 2025-05-07T19:42:58.2805449Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.hip 2025-05-07T19:42:58.2805972Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum_hip.cpp 2025-05-07T19:42:58.2806542Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.hip 2025-05-07T19:42:58.2807000Z Removing fbgemm_gpu/src/sparse_ops/sparse_async_cumsum_hip.cpp 2025-05-07T19:42:58.2807515Z Removing fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.hip 2025-05-07T19:42:58.2808053Z Removing fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.hip 2025-05-07T19:42:58.2808689Z Removing fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.hip 2025-05-07T19:42:58.2809190Z Removing fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.hip 2025-05-07T19:42:58.2809703Z Removing fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.hip 2025-05-07T19:42:58.2810178Z Removing fbgemm_gpu/src/sparse_ops/sparse_group_index.hip 2025-05-07T19:42:58.2810575Z Removing fbgemm_gpu/src/sparse_ops/sparse_index_add.hip 2025-05-07T19:42:58.2810988Z Removing fbgemm_gpu/src/sparse_ops/sparse_index_select.hip 2025-05-07T19:42:58.2811403Z Removing fbgemm_gpu/src/sparse_ops/sparse_invert_permute.hip 2025-05-07T19:42:58.2811833Z Removing fbgemm_gpu/src/sparse_ops/sparse_ops_cpu_hip.cpp 2025-05-07T19:42:58.2812286Z Removing fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.hip 2025-05-07T19:42:58.2812763Z Removing fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.hip 2025-05-07T19:42:58.2813216Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute102.hip 2025-05-07T19:42:58.2813617Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_1d.hip 2025-05-07T19:42:58.2814031Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_2d.hip 2025-05-07T19:42:58.2814452Z Removing fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.hip 2025-05-07T19:42:58.2814874Z Removing fbgemm_gpu/src/sparse_ops/sparse_range.hip 2025-05-07T19:42:58.2815290Z Removing fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.hip 2025-05-07T19:42:58.2815731Z Removing fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.hip 2025-05-07T19:42:58.2816137Z Removing fbgemm_gpu/src/sparse_ops/sparse_zipf.hip 2025-05-07T19:42:58.2816564Z Removing fbgemm_gpu/src/split_embeddings_cache/cachelib_cache_hip.cpp 2025-05-07T19:42:58.2817042Z Removing fbgemm_gpu/src/split_embeddings_cache/common_hip.cuh 2025-05-07T19:42:58.2817463Z Removing fbgemm_gpu/src/split_embeddings_cache/common_hip.h 2025-05-07T19:42:58.2817918Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.hip 2025-05-07T19:42:58.2818414Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.hip 2025-05-07T19:42:58.2818929Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.hip 2025-05-07T19:42:58.2819483Z Removing fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte_hip.cpp 2025-05-07T19:42:58.2820022Z Removing fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.hip 2025-05-07T19:42:58.2820673Z Removing fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices_hip.cpp 2025-05-07T19:42:58.2821199Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.hip 2025-05-07T19:42:58.2821674Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.hip 2025-05-07T19:42:58.2822198Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.hip 2025-05-07T19:42:58.2822733Z Removing fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte_hip.cpp 2025-05-07T19:42:58.2823243Z Removing fbgemm_gpu/src/split_embeddings_cache/lxu_cache.hip 2025-05-07T19:42:58.2823679Z Removing fbgemm_gpu/src/split_embeddings_cache/lxu_cache_hip.cpp 2025-05-07T19:42:58.2824286Z Removing fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.hip 2025-05-07T19:42:58.2824840Z Removing fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.hip 2025-05-07T19:42:58.2825409Z Removing fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops_hip.cpp 2025-05-07T19:42:58.2825973Z Removing fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.hip 2025-05-07T19:42:58.2826479Z Removing fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.hip 2025-05-07T19:42:58.2826984Z Removing fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.hip 2025-05-07T19:42:58.2827551Z Removing fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_hip.cpp 2025-05-07T19:42:58.2828108Z Removing fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.hip 2025-05-07T19:42:58.2828679Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/embedding_rocksdb_wrapper_hip.h 2025-05-07T19:42:58.2829216Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_hip_utils.cpp 2025-05-07T19:42:58.2829717Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_hip_utils.h 2025-05-07T19:42:58.2830271Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_table_batched_embeddings_hip.cpp 2025-05-07T19:42:58.2830901Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_db_table_batched_embeddings_hip.h 2025-05-07T19:42:58.2831502Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/kv_tensor_wrapper_cpu_hip.cpp 2025-05-07T19:42:58.2832089Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_scratch_pad_indices_queue_hip.cpp 2025-05-07T19:42:58.2832710Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_split_embeddings_cache_hip.hip 2025-05-07T19:42:58.2833338Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_split_table_batched_embeddings_hip.cpp 2025-05-07T19:42:58.2833980Z Removing fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_table_batched_embeddings_hip.h 2025-05-07T19:42:58.2834456Z Removing fbgemm_gpu/src/topology_utils_hip.cpp 2025-05-07T19:42:58.2834845Z Removing fbgemm_gpu/test/tbe/utils/cpu_kernel_test_hip.cpp 2025-05-07T19:42:58.2835263Z Removing fbgemm_gpu/test/utils/kernel_launcher_test.hip 2025-05-07T19:42:58.2835666Z Removing fbgemm_gpu/test/utils/stochastic_rounding_test.hip 2025-05-07T19:42:58.2836170Z Removing fbgemm_gpu/test/utils/tensor_accessor2_test.hip 2025-05-07T19:42:58.2836793Z Removing fbgemm_gpu/test/utils/tensor_accessor_builder_test.hip 2025-05-07T19:42:58.2837345Z Removing fbgemm_gpu/test/utils/tensor_accessor_builder_with_memcheck_test.hip 2025-05-07T19:42:58.2837853Z Removing fbgemm_gpu/test/utils/tensor_accessor_test.hip 2025-05-07T19:42:58.2838345Z Removing fbgemm_gpu/test/utils/tensor_accessor_with_memcheck_test.hip 2025-05-07T19:42:58.2838818Z Removing fbgemm_gpu/test/utils/weight_row_test.hip 2025-05-07T19:42:58.2840958Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:42:58.3749410Z HEAD is now at 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:58.3753555Z ##[endgroup] 2025-05-07T19:42:58.3755057Z ##[group]Disabling automatic garbage collection 2025-05-07T19:42:58.3761882Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:42:58.3790284Z ##[endgroup] 2025-05-07T19:42:58.3790722Z ##[group]Setting up auth 2025-05-07T19:42:58.3795280Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:42:58.3821206Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:42:58.4126653Z Entering 'external/asmjit' 2025-05-07T19:42:58.4188161Z Entering 'external/composable_kernel' 2025-05-07T19:42:58.4250243Z Entering 'external/cpuinfo' 2025-05-07T19:42:58.4308688Z Entering 'external/cutlass' 2025-05-07T19:42:58.4375636Z Entering 'external/googletest' 2025-05-07T19:42:58.4437930Z Entering 'external/hipify_torch' 2025-05-07T19:42:58.4493349Z Entering 'external/json' 2025-05-07T19:42:58.4579474Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:42:58.4608429Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:42:58.4901437Z Entering 'external/asmjit' 2025-05-07T19:42:58.4956272Z Entering 'external/composable_kernel' 2025-05-07T19:42:58.5021870Z Entering 'external/cpuinfo' 2025-05-07T19:42:58.5077814Z Entering 'external/cutlass' 2025-05-07T19:42:58.5142259Z Entering 'external/googletest' 2025-05-07T19:42:58.5198831Z Entering 'external/hipify_torch' 2025-05-07T19:42:58.5254659Z Entering 'external/json' 2025-05-07T19:42:58.5325590Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:58.5361155Z ##[endgroup] 2025-05-07T19:42:58.5361562Z ##[group]Fetching the repository 2025-05-07T19:42:58.5367854Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:42:58.7117086Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:42:58.7117704Z + 1c9ad64...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:42:58.7133289Z ##[endgroup] 2025-05-07T19:42:58.7133684Z ##[group]Determining the checkout info 2025-05-07T19:42:58.7134139Z ##[endgroup] 2025-05-07T19:42:58.7136474Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:42:58.7650691Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:42:58.7675797Z ##[group]Checking out the ref 2025-05-07T19:42:58.7676357Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:42:58.8673811Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:42:58.8674308Z any of your branches: 2025-05-07T19:42:58.8674773Z 2025-05-07T19:42:58.8675313Z 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:58.8675800Z 2025-05-07T19:42:58.8676124Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:42:58.8676528Z to do so with: 2025-05-07T19:42:58.8676659Z 2025-05-07T19:42:58.8676807Z git branch 1c9ad64 2025-05-07T19:42:58.8677017Z 2025-05-07T19:42:58.8677426Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:58.8678862Z ##[endgroup] 2025-05-07T19:42:58.8679366Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:42:58.8681795Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:58.8721635Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:42:58.8744478Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:42:58.8770249Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:42:58.8792738Z ##[endgroup] 2025-05-07T19:42:58.8793485Z ##[group]Fetching submodules 2025-05-07T19:42:58.8793878Z [command]/usr/bin/git submodule sync 2025-05-07T19:42:58.9151941Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:42:58.9153321Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:42:58.9154472Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:42:58.9154860Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:42:58.9155274Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:42:58.9155695Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:42:58.9156240Z Synchronizing submodule url for 'external/json' 2025-05-07T19:42:58.9172847Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:42:58.9986958Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:42:59.2911283Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:42:59.3926154Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:43:00.0552789Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:43:00.0986025Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:43:00.1071491Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:43:00.2201686Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:43:00.2211673Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:43:00.2496514Z Entering 'external/asmjit' 2025-05-07T19:43:00.2518908Z Entering 'external/composable_kernel' 2025-05-07T19:43:00.2556660Z Entering 'external/cpuinfo' 2025-05-07T19:43:00.2592429Z Entering 'external/cutlass' 2025-05-07T19:43:00.2620375Z Entering 'external/googletest' 2025-05-07T19:43:00.2654769Z Entering 'external/hipify_torch' 2025-05-07T19:43:00.2681426Z Entering 'external/json' 2025-05-07T19:43:00.2715946Z ##[endgroup] 2025-05-07T19:43:00.2716438Z ##[group]Persisting credentials for submodules 2025-05-07T19:43:00.2718690Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:43:00.2992394Z Entering 'external/asmjit' 2025-05-07T19:43:00.3038466Z Entering 'external/composable_kernel' 2025-05-07T19:43:00.3109118Z Entering 'external/cpuinfo' 2025-05-07T19:43:00.3160866Z Entering 'external/cutlass' 2025-05-07T19:43:00.3235434Z Entering 'external/googletest' 2025-05-07T19:43:00.3296208Z Entering 'external/hipify_torch' 2025-05-07T19:43:00.3360137Z Entering 'external/json' 2025-05-07T19:43:00.3431983Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:43:00.3703023Z Entering 'external/asmjit' 2025-05-07T19:43:00.3761760Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:43:00.3763738Z Entering 'external/composable_kernel' 2025-05-07T19:43:00.3820601Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:43:00.3822199Z Entering 'external/cpuinfo' 2025-05-07T19:43:00.3877712Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:43:00.3878260Z Entering 'external/cutlass' 2025-05-07T19:43:00.3935997Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:43:00.3938042Z Entering 'external/googletest' 2025-05-07T19:43:00.3998054Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:43:00.3998606Z Entering 'external/hipify_torch' 2025-05-07T19:43:00.4051488Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:43:00.4054108Z Entering 'external/json' 2025-05-07T19:43:00.4107375Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:43:00.4184346Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:43:00.4497184Z Entering 'external/asmjit' 2025-05-07T19:43:00.4519009Z Entering 'external/composable_kernel' 2025-05-07T19:43:00.4549611Z Entering 'external/cpuinfo' 2025-05-07T19:43:00.4585555Z Entering 'external/cutlass' 2025-05-07T19:43:00.4619400Z Entering 'external/googletest' 2025-05-07T19:43:00.4648114Z Entering 'external/hipify_torch' 2025-05-07T19:43:00.4676046Z Entering 'external/json' 2025-05-07T19:43:00.4711795Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:43:00.4998202Z Entering 'external/asmjit' 2025-05-07T19:43:00.5023389Z Entering 'external/composable_kernel' 2025-05-07T19:43:00.5043013Z Entering 'external/cpuinfo' 2025-05-07T19:43:00.5071633Z Entering 'external/cutlass' 2025-05-07T19:43:00.5096787Z Entering 'external/googletest' 2025-05-07T19:43:00.5128349Z Entering 'external/hipify_torch' 2025-05-07T19:43:00.5152538Z Entering 'external/json' 2025-05-07T19:43:00.5188803Z ##[endgroup] 2025-05-07T19:43:00.5216328Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:43:00.5237637Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:00.5370359Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:43:00.5370729Z . $PRELUDE; print_system_info 2025-05-07T19:43:00.5371192Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:00.5371509Z env: 2025-05-07T19:43:00.5371715Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:00.5372016Z BUILD_ENV: build_binary 2025-05-07T19:43:00.5372242Z BUILD_TARGET: default 2025-05-07T19:43:00.5372471Z BUILD_VARIANT: cuda 2025-05-07T19:43:00.5372693Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:00.5372938Z ##[endgroup] 2025-05-07T19:43:01.0288513Z ################################################################################ 2025-05-07T19:43:01.0289048Z # Print System Info 2025-05-07T19:43:01.0289714Z # 2025-05-07T19:43:01.0315402Z # [2025-05-07T19:43:01.030Z] + print_system_info 2025-05-07T19:43:01.0316030Z ################################################################################ 2025-05-07T19:43:01.0316297Z 2025-05-07T19:43:01.0316695Z ################################################################################ 2025-05-07T19:43:01.0317039Z [INFO] Printing environment variables ... 2025-05-07T19:43:01.0317444Z + printenv 2025-05-07T19:43:01.0317566Z 2025-05-07T19:43:01.0328591Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:01.0329413Z BUILD_VARIANT=cuda 2025-05-07T19:43:01.0329755Z HOSTNAME=12a11cea79f2 2025-05-07T19:43:01.0330194Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_51dfb36f-b560-44b0-ac28-52b081544f21 2025-05-07T19:43:01.0330687Z GITHUB_ACTION=__run_2 2025-05-07T19:43:01.0330921Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:01.0331183Z RUNNER_NAME=i-053f9a24237032a22 2025-05-07T19:43:01.0331462Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:01.0331799Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:01.0332066Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:01.0332321Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:01.0332609Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:01.0332904Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:01.0333364Z *** 2025-05-07T19:43:01.0333566Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:01.0333842Z GITHUB_ACTIONS=true 2025-05-07T19:43:01.0334109Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:01.0334688Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:01.0335238Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:01.0335510Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:01.0335780Z RUNNER_OS=Linux 2025-05-07T19:43:01.0336003Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:01.0336260Z HOME=/github/home 2025-05-07T19:43:01.0336505Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:01.0336832Z RUNNER_ARCH=X64 2025-05-07T19:43:01.0337051Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:01.0337310Z BUILD_TARGET=default 2025-05-07T19:43:01.0337728Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_51dfb36f-b560-44b0-ac28-52b081544f21 2025-05-07T19:43:01.0338386Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_51dfb36f-b560-44b0-ac28-52b081544f21 2025-05-07T19:43:01.0338879Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:01.0339222Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:01.0339488Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:01.0339972Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_51dfb36f-b560-44b0-ac28-52b081544f21 2025-05-07T19:43:01.0340620Z BUILD_ENV=build_binary 2025-05-07T19:43:01.0340847Z GITHUB_ACTOR=q10 2025-05-07T19:43:01.0341081Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:01.0341303Z KERN_NAME_LC=linux 2025-05-07T19:43:01.0341541Z BUILD_CUDA_VERSION=12.6.3 2025-05-07T19:43:01.0341942Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:01.0342285Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:01.0342809Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:01.0343093Z SHLVL=1 2025-05-07T19:43:01.0343282Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:01.0343527Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:01.0344002Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:01.0344605Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:01.0344860Z KERN_NAME=Linux 2025-05-07T19:43:01.0345080Z GITHUB_JOB=build_artifact 2025-05-07T19:43:01.0345520Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:01.0345802Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:01.0346069Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:01.0346332Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:01.0346874Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:01.0347252Z GITHUB_BASE_REF=main 2025-05-07T19:43:01.0347604Z CI=true 2025-05-07T19:43:01.0347878Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:01.0348179Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:01.0348476Z GITHUB_ACTION_REF= 2025-05-07T19:43:01.0348722Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:01.0349226Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_51dfb36f-b560-44b0-ac28-52b081544f21 2025-05-07T19:43:01.0349704Z MACHINE_NAME=x86_64 2025-05-07T19:43:01.0349946Z _=/usr/bin/printenv 2025-05-07T19:43:01.0350083Z 2025-05-07T19:43:01.0350200Z ################################################################################ 2025-05-07T19:43:01.0350542Z [INFO] Print ldd version ... 2025-05-07T19:43:01.0350802Z + ldd --version 2025-05-07T19:43:01.0350952Z 2025-05-07T19:43:01.0351063Z ldd (GNU libc) 2.34 2025-05-07T19:43:01.0351354Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:01.0351807Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:01.0352374Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:01.0352834Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:01.0353079Z 2025-05-07T19:43:01.0353198Z ################################################################################ 2025-05-07T19:43:01.0353533Z [INFO] Print CPU info ... 2025-05-07T19:43:01.0353770Z + nproc 2025-05-07T19:43:01.0353882Z 2025-05-07T19:43:01.0359093Z 96 2025-05-07T19:43:01.0360092Z 2025-05-07T19:43:01.0360396Z + lscpu 2025-05-07T19:43:01.0625524Z 2025-05-07T19:43:01.0626051Z Architecture: x86_64 2025-05-07T19:43:01.0627217Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:01.0628401Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0629590Z Byte Order: Little Endian 2025-05-07T19:43:01.0630520Z CPU(s): 96 2025-05-07T19:43:01.0631401Z On-line CPU(s) list: 0-95 2025-05-07T19:43:01.0632329Z Vendor ID: GenuineIntel 2025-05-07T19:43:01.0633296Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0633685Z CPU family: 6 2025-05-07T19:43:01.0633952Z Model: 85 2025-05-07T19:43:01.0634237Z Thread(s) per core: 2 2025-05-07T19:43:01.0634519Z Core(s) per socket: 24 2025-05-07T19:43:01.0634804Z Socket(s): 2 2025-05-07T19:43:01.0635066Z Stepping: 7 2025-05-07T19:43:01.0635361Z BogoMIPS: 5999.98 2025-05-07T19:43:01.0638002Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0640635Z Hypervisor vendor: KVM 2025-05-07T19:43:01.0641109Z Virtualization type: full 2025-05-07T19:43:01.0641484Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:01.0641872Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:01.0642266Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:01.0642663Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:01.0643003Z NUMA node(s): 2 2025-05-07T19:43:01.0643324Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:01.0643672Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:01.0644158Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:01.0644733Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:01.0645336Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:01.0645986Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:01.0646774Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:01.0647432Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:01.0648071Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:01.0648497Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:01.0648886Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:01.0649306Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:01.0649923Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:01.0650788Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:01.0651511Z Vulnerability Srbds: Not affected 2025-05-07T19:43:01.0651916Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:01.0652203Z 2025-05-07T19:43:01.0652306Z + cat /proc/cpuinfo 2025-05-07T19:43:01.0652463Z 2025-05-07T19:43:01.0653041Z processor : 0 2025-05-07T19:43:01.0653278Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0653569Z cpu family : 6 2025-05-07T19:43:01.0653798Z model : 85 2025-05-07T19:43:01.0654129Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0654497Z stepping : 7 2025-05-07T19:43:01.0654763Z microcode : 0x5003901 2025-05-07T19:43:01.0655011Z cpu MHz : 3242.446 2025-05-07T19:43:01.0655278Z cache size : 36608 KB 2025-05-07T19:43:01.0655522Z physical id : 0 2025-05-07T19:43:01.0655788Z siblings : 48 2025-05-07T19:43:01.0656044Z core id : 0 2025-05-07T19:43:01.0656271Z cpu cores : 24 2025-05-07T19:43:01.0656524Z apicid : 0 2025-05-07T19:43:01.0656753Z initial apicid : 0 2025-05-07T19:43:01.0657018Z fpu : yes 2025-05-07T19:43:01.0657234Z fpu_exception : yes 2025-05-07T19:43:01.0657497Z cpuid level : 13 2025-05-07T19:43:01.0657728Z wp : yes 2025-05-07T19:43:01.0660036Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0662745Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0663468Z bogomips : 5999.98 2025-05-07T19:43:01.0663708Z clflush size : 64 2025-05-07T19:43:01.0663932Z cache_alignment : 64 2025-05-07T19:43:01.0664226Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0664650Z power management: 2025-05-07T19:43:01.0664786Z 2025-05-07T19:43:01.0664877Z processor : 1 2025-05-07T19:43:01.0665109Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0665351Z cpu family : 6 2025-05-07T19:43:01.0665572Z model : 85 2025-05-07T19:43:01.0665850Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0666235Z stepping : 7 2025-05-07T19:43:01.0666443Z microcode : 0x5003901 2025-05-07T19:43:01.0666683Z cpu MHz : 1601.124 2025-05-07T19:43:01.0666901Z cache size : 36608 KB 2025-05-07T19:43:01.0667140Z physical id : 0 2025-05-07T19:43:01.0667367Z siblings : 48 2025-05-07T19:43:01.0667568Z core id : 1 2025-05-07T19:43:01.0667786Z cpu cores : 24 2025-05-07T19:43:01.0667986Z apicid : 2 2025-05-07T19:43:01.0668203Z initial apicid : 2 2025-05-07T19:43:01.0668413Z fpu : yes 2025-05-07T19:43:01.0668624Z fpu_exception : yes 2025-05-07T19:43:01.0668846Z cpuid level : 13 2025-05-07T19:43:01.0669065Z wp : yes 2025-05-07T19:43:01.0671310Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0673931Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0674528Z bogomips : 5999.98 2025-05-07T19:43:01.0674766Z clflush size : 64 2025-05-07T19:43:01.0675024Z cache_alignment : 64 2025-05-07T19:43:01.0675337Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0675678Z power management: 2025-05-07T19:43:01.0675901Z 2025-05-07T19:43:01.0676039Z processor : 2 2025-05-07T19:43:01.0676272Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0676561Z cpu family : 6 2025-05-07T19:43:01.0676855Z model : 85 2025-05-07T19:43:01.0677175Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0677550Z stepping : 7 2025-05-07T19:43:01.0677802Z microcode : 0x5003901 2025-05-07T19:43:01.0678047Z cpu MHz : 3324.298 2025-05-07T19:43:01.0678306Z cache size : 36608 KB 2025-05-07T19:43:01.0678546Z physical id : 0 2025-05-07T19:43:01.0678803Z siblings : 48 2025-05-07T19:43:01.0679050Z core id : 2 2025-05-07T19:43:01.0679268Z cpu cores : 24 2025-05-07T19:43:01.0679516Z apicid : 4 2025-05-07T19:43:01.0679736Z initial apicid : 4 2025-05-07T19:43:01.0679992Z fpu : yes 2025-05-07T19:43:01.0680216Z fpu_exception : yes 2025-05-07T19:43:01.0680478Z cpuid level : 13 2025-05-07T19:43:01.0680704Z wp : yes 2025-05-07T19:43:01.0682992Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0685626Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0686224Z bogomips : 5999.98 2025-05-07T19:43:01.0686477Z clflush size : 64 2025-05-07T19:43:01.0686797Z cache_alignment : 64 2025-05-07T19:43:01.0687114Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0687567Z power management: 2025-05-07T19:43:01.0687707Z 2025-05-07T19:43:01.0687803Z processor : 3 2025-05-07T19:43:01.0688124Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0688398Z cpu family : 6 2025-05-07T19:43:01.0688656Z model : 85 2025-05-07T19:43:01.0688961Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0689362Z stepping : 7 2025-05-07T19:43:01.0689627Z microcode : 0x5003901 2025-05-07T19:43:01.0689876Z cpu MHz : 3790.566 2025-05-07T19:43:01.0690143Z cache size : 36608 KB 2025-05-07T19:43:01.0690389Z physical id : 0 2025-05-07T19:43:01.0690659Z siblings : 48 2025-05-07T19:43:01.0690887Z core id : 3 2025-05-07T19:43:01.0691140Z cpu cores : 24 2025-05-07T19:43:01.0691372Z apicid : 6 2025-05-07T19:43:01.0691624Z initial apicid : 6 2025-05-07T19:43:01.0691862Z fpu : yes 2025-05-07T19:43:01.0692118Z fpu_exception : yes 2025-05-07T19:43:01.0692365Z cpuid level : 13 2025-05-07T19:43:01.0692626Z wp : yes 2025-05-07T19:43:01.0694926Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0697559Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0698187Z bogomips : 5999.98 2025-05-07T19:43:01.0698454Z clflush size : 64 2025-05-07T19:43:01.0698702Z cache_alignment : 64 2025-05-07T19:43:01.0699021Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0699371Z power management: 2025-05-07T19:43:01.0699546Z 2025-05-07T19:43:01.0699646Z processor : 4 2025-05-07T19:43:01.0699883Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0700168Z cpu family : 6 2025-05-07T19:43:01.0700387Z model : 85 2025-05-07T19:43:01.0700698Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0701088Z stepping : 7 2025-05-07T19:43:01.0701314Z microcode : 0x5003901 2025-05-07T19:43:01.0701576Z cpu MHz : 3187.256 2025-05-07T19:43:01.0701811Z cache size : 36608 KB 2025-05-07T19:43:01.0702074Z physical id : 0 2025-05-07T19:43:01.0702301Z siblings : 48 2025-05-07T19:43:01.0702542Z core id : 4 2025-05-07T19:43:01.0702759Z cpu cores : 24 2025-05-07T19:43:01.0703002Z apicid : 8 2025-05-07T19:43:01.0703221Z initial apicid : 8 2025-05-07T19:43:01.0703477Z fpu : yes 2025-05-07T19:43:01.0703708Z fpu_exception : yes 2025-05-07T19:43:01.0703972Z cpuid level : 13 2025-05-07T19:43:01.0704208Z wp : yes 2025-05-07T19:43:01.0706496Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0709140Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0709760Z bogomips : 5999.98 2025-05-07T19:43:01.0709996Z clflush size : 64 2025-05-07T19:43:01.0710259Z cache_alignment : 64 2025-05-07T19:43:01.0710549Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0711004Z power management: 2025-05-07T19:43:01.0711152Z 2025-05-07T19:43:01.0711252Z processor : 5 2025-05-07T19:43:01.0711516Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0711778Z cpu family : 6 2025-05-07T19:43:01.0712098Z model : 85 2025-05-07T19:43:01.0712393Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0712781Z stepping : 7 2025-05-07T19:43:01.0713031Z microcode : 0x5003901 2025-05-07T19:43:01.0713273Z cpu MHz : 3216.596 2025-05-07T19:43:01.0713536Z cache size : 36608 KB 2025-05-07T19:43:01.0713784Z physical id : 0 2025-05-07T19:43:01.0714038Z siblings : 48 2025-05-07T19:43:01.0714245Z core id : 5 2025-05-07T19:43:01.0714474Z cpu cores : 24 2025-05-07T19:43:01.0714695Z apicid : 10 2025-05-07T19:43:01.0714914Z initial apicid : 10 2025-05-07T19:43:01.0715135Z fpu : yes 2025-05-07T19:43:01.0715378Z fpu_exception : yes 2025-05-07T19:43:01.0715598Z cpuid level : 13 2025-05-07T19:43:01.0715943Z wp : yes 2025-05-07T19:43:01.0718243Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0720860Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0721489Z bogomips : 5999.98 2025-05-07T19:43:01.0721757Z clflush size : 64 2025-05-07T19:43:01.0721986Z cache_alignment : 64 2025-05-07T19:43:01.0722410Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0722741Z power management: 2025-05-07T19:43:01.0722907Z 2025-05-07T19:43:01.0722991Z processor : 6 2025-05-07T19:43:01.0723197Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0723444Z cpu family : 6 2025-05-07T19:43:01.0723642Z model : 85 2025-05-07T19:43:01.0723927Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0724282Z stepping : 7 2025-05-07T19:43:01.0724482Z microcode : 0x5003901 2025-05-07T19:43:01.0724716Z cpu MHz : 2999.994 2025-05-07T19:43:01.0724926Z cache size : 36608 KB 2025-05-07T19:43:01.0725159Z physical id : 0 2025-05-07T19:43:01.0725365Z siblings : 48 2025-05-07T19:43:01.0725577Z core id : 6 2025-05-07T19:43:01.0725772Z cpu cores : 24 2025-05-07T19:43:01.0725989Z apicid : 12 2025-05-07T19:43:01.0726191Z initial apicid : 12 2025-05-07T19:43:01.0726417Z fpu : yes 2025-05-07T19:43:01.0726614Z fpu_exception : yes 2025-05-07T19:43:01.0726838Z cpuid level : 13 2025-05-07T19:43:01.0727043Z wp : yes 2025-05-07T19:43:01.0729247Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0731798Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0732377Z bogomips : 5999.98 2025-05-07T19:43:01.0732588Z clflush size : 64 2025-05-07T19:43:01.0732812Z cache_alignment : 64 2025-05-07T19:43:01.0733077Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0733408Z power management: 2025-05-07T19:43:01.0733538Z 2025-05-07T19:43:01.0733702Z processor : 7 2025-05-07T19:43:01.0733934Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0734171Z cpu family : 6 2025-05-07T19:43:01.0734392Z model : 85 2025-05-07T19:43:01.0734667Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0735090Z stepping : 7 2025-05-07T19:43:01.0735311Z microcode : 0x5003901 2025-05-07T19:43:01.0735535Z cpu MHz : 2999.994 2025-05-07T19:43:01.0735769Z cache size : 36608 KB 2025-05-07T19:43:01.0735996Z physical id : 0 2025-05-07T19:43:01.0736224Z siblings : 48 2025-05-07T19:43:01.0736421Z core id : 7 2025-05-07T19:43:01.0757282Z cpu cores : 24 2025-05-07T19:43:01.0757546Z apicid : 14 2025-05-07T19:43:01.0757760Z initial apicid : 14 2025-05-07T19:43:01.0757999Z fpu : yes 2025-05-07T19:43:01.0758203Z fpu_exception : yes 2025-05-07T19:43:01.0758443Z cpuid level : 13 2025-05-07T19:43:01.0758653Z wp : yes 2025-05-07T19:43:01.0760952Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0763594Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0764180Z bogomips : 5999.98 2025-05-07T19:43:01.0764587Z clflush size : 64 2025-05-07T19:43:01.0764808Z cache_alignment : 64 2025-05-07T19:43:01.0765102Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0765431Z power management: 2025-05-07T19:43:01.0765583Z 2025-05-07T19:43:01.0765674Z processor : 8 2025-05-07T19:43:01.0765911Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0766153Z cpu family : 6 2025-05-07T19:43:01.0766382Z model : 85 2025-05-07T19:43:01.0766660Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0767035Z stepping : 7 2025-05-07T19:43:01.0767253Z microcode : 0x5003901 2025-05-07T19:43:01.0767500Z cpu MHz : 2999.994 2025-05-07T19:43:01.0767719Z cache size : 36608 KB 2025-05-07T19:43:01.0767963Z physical id : 0 2025-05-07T19:43:01.0768172Z siblings : 48 2025-05-07T19:43:01.0768390Z core id : 8 2025-05-07T19:43:01.0768590Z cpu cores : 24 2025-05-07T19:43:01.0768806Z apicid : 16 2025-05-07T19:43:01.0769022Z initial apicid : 16 2025-05-07T19:43:01.0769233Z fpu : yes 2025-05-07T19:43:01.0769445Z fpu_exception : yes 2025-05-07T19:43:01.0769660Z cpuid level : 13 2025-05-07T19:43:01.0769883Z wp : yes 2025-05-07T19:43:01.0772129Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0774806Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0775408Z bogomips : 5999.98 2025-05-07T19:43:01.0775810Z clflush size : 64 2025-05-07T19:43:01.0776095Z cache_alignment : 64 2025-05-07T19:43:01.0776374Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0776724Z power management: 2025-05-07T19:43:01.0776859Z 2025-05-07T19:43:01.0776963Z processor : 9 2025-05-07T19:43:01.0777181Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0777606Z cpu family : 6 2025-05-07T19:43:01.0777814Z model : 85 2025-05-07T19:43:01.0778109Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0778465Z stepping : 7 2025-05-07T19:43:01.0778694Z microcode : 0x5003901 2025-05-07T19:43:01.0779017Z cpu MHz : 3229.445 2025-05-07T19:43:01.0779257Z cache size : 36608 KB 2025-05-07T19:43:01.0779488Z physical id : 0 2025-05-07T19:43:01.0779726Z siblings : 48 2025-05-07T19:43:01.0779936Z core id : 9 2025-05-07T19:43:01.0780153Z cpu cores : 24 2025-05-07T19:43:01.0780377Z apicid : 18 2025-05-07T19:43:01.0780581Z initial apicid : 18 2025-05-07T19:43:01.0780810Z fpu : yes 2025-05-07T19:43:01.0781013Z fpu_exception : yes 2025-05-07T19:43:01.0781246Z cpuid level : 13 2025-05-07T19:43:01.0781456Z wp : yes 2025-05-07T19:43:01.0783713Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0786330Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0786909Z bogomips : 5999.98 2025-05-07T19:43:01.0787144Z clflush size : 64 2025-05-07T19:43:01.0787365Z cache_alignment : 64 2025-05-07T19:43:01.0787782Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0788104Z power management: 2025-05-07T19:43:01.0788249Z 2025-05-07T19:43:01.0788334Z processor : 10 2025-05-07T19:43:01.0788564Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0788800Z cpu family : 6 2025-05-07T19:43:01.0789018Z model : 85 2025-05-07T19:43:01.0789293Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0789676Z stepping : 7 2025-05-07T19:43:01.0790054Z microcode : 0x5003901 2025-05-07T19:43:01.0790294Z cpu MHz : 2999.994 2025-05-07T19:43:01.0790512Z cache size : 36608 KB 2025-05-07T19:43:01.0790754Z physical id : 0 2025-05-07T19:43:01.0790963Z siblings : 48 2025-05-07T19:43:01.0791189Z core id : 10 2025-05-07T19:43:01.0791391Z cpu cores : 24 2025-05-07T19:43:01.0791610Z apicid : 20 2025-05-07T19:43:01.0791826Z initial apicid : 20 2025-05-07T19:43:01.0792036Z fpu : yes 2025-05-07T19:43:01.0792247Z fpu_exception : yes 2025-05-07T19:43:01.0792467Z cpuid level : 13 2025-05-07T19:43:01.0792688Z wp : yes 2025-05-07T19:43:01.0794934Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0797639Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0798239Z bogomips : 5999.98 2025-05-07T19:43:01.0798455Z clflush size : 64 2025-05-07T19:43:01.0798690Z cache_alignment : 64 2025-05-07T19:43:01.0798965Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0799303Z power management: 2025-05-07T19:43:01.0799438Z 2025-05-07T19:43:01.0799542Z processor : 11 2025-05-07T19:43:01.0799763Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0800015Z cpu family : 6 2025-05-07T19:43:01.0800219Z model : 85 2025-05-07T19:43:01.0800505Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0800966Z stepping : 7 2025-05-07T19:43:01.0801196Z microcode : 0x5003901 2025-05-07T19:43:01.0801425Z cpu MHz : 2999.994 2025-05-07T19:43:01.0801660Z cache size : 36608 KB 2025-05-07T19:43:01.0801947Z physical id : 0 2025-05-07T19:43:01.0802173Z siblings : 48 2025-05-07T19:43:01.0802376Z core id : 11 2025-05-07T19:43:01.0802598Z cpu cores : 24 2025-05-07T19:43:01.0802820Z apicid : 22 2025-05-07T19:43:01.0803029Z initial apicid : 22 2025-05-07T19:43:01.0803258Z fpu : yes 2025-05-07T19:43:01.0803459Z fpu_exception : yes 2025-05-07T19:43:01.0803691Z cpuid level : 13 2025-05-07T19:43:01.0803897Z wp : yes 2025-05-07T19:43:01.0806155Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0808768Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0809346Z bogomips : 5999.98 2025-05-07T19:43:01.0809580Z clflush size : 64 2025-05-07T19:43:01.0809799Z cache_alignment : 64 2025-05-07T19:43:01.0810092Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0810420Z power management: 2025-05-07T19:43:01.0810572Z 2025-05-07T19:43:01.0810657Z processor : 12 2025-05-07T19:43:01.0810891Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0811130Z cpu family : 6 2025-05-07T19:43:01.0811348Z model : 85 2025-05-07T19:43:01.0811621Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0811990Z stepping : 7 2025-05-07T19:43:01.0812196Z microcode : 0x5003901 2025-05-07T19:43:01.0812428Z cpu MHz : 3210.666 2025-05-07T19:43:01.0812644Z cache size : 36608 KB 2025-05-07T19:43:01.0812881Z physical id : 0 2025-05-07T19:43:01.0813092Z siblings : 48 2025-05-07T19:43:01.0813307Z core id : 12 2025-05-07T19:43:01.0813522Z cpu cores : 24 2025-05-07T19:43:01.0813724Z apicid : 24 2025-05-07T19:43:01.0813947Z initial apicid : 24 2025-05-07T19:43:01.0814162Z fpu : yes 2025-05-07T19:43:01.0814376Z fpu_exception : yes 2025-05-07T19:43:01.0814593Z cpuid level : 13 2025-05-07T19:43:01.0814815Z wp : yes 2025-05-07T19:43:01.0817053Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0819669Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0820252Z bogomips : 5999.98 2025-05-07T19:43:01.0820467Z clflush size : 64 2025-05-07T19:43:01.0820702Z cache_alignment : 64 2025-05-07T19:43:01.0820975Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0821311Z power management: 2025-05-07T19:43:01.0821442Z 2025-05-07T19:43:01.0821546Z processor : 13 2025-05-07T19:43:01.0821764Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0822018Z cpu family : 6 2025-05-07T19:43:01.0822218Z model : 85 2025-05-07T19:43:01.0822503Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0822854Z stepping : 7 2025-05-07T19:43:01.0823147Z microcode : 0x5003901 2025-05-07T19:43:01.0823365Z cpu MHz : 2999.994 2025-05-07T19:43:01.0823588Z cache size : 36608 KB 2025-05-07T19:43:01.0823837Z physical id : 0 2025-05-07T19:43:01.0824046Z siblings : 48 2025-05-07T19:43:01.0824320Z core id : 13 2025-05-07T19:43:01.0824538Z cpu cores : 24 2025-05-07T19:43:01.0824739Z apicid : 26 2025-05-07T19:43:01.0824955Z initial apicid : 26 2025-05-07T19:43:01.0825166Z fpu : yes 2025-05-07T19:43:01.0825382Z fpu_exception : yes 2025-05-07T19:43:01.0825604Z cpuid level : 13 2025-05-07T19:43:01.0825830Z wp : yes 2025-05-07T19:43:01.0828064Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0830680Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0831358Z bogomips : 5999.98 2025-05-07T19:43:01.0831574Z clflush size : 64 2025-05-07T19:43:01.0831805Z cache_alignment : 64 2025-05-07T19:43:01.0832075Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0832411Z power management: 2025-05-07T19:43:01.0832544Z 2025-05-07T19:43:01.0832645Z processor : 14 2025-05-07T19:43:01.0832859Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0833165Z cpu family : 6 2025-05-07T19:43:01.0833367Z model : 85 2025-05-07T19:43:01.0833652Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0833998Z stepping : 7 2025-05-07T19:43:01.0834217Z microcode : 0x5003901 2025-05-07T19:43:01.0834519Z cpu MHz : 2999.994 2025-05-07T19:43:01.0834749Z cache size : 36608 KB 2025-05-07T19:43:01.0834977Z physical id : 0 2025-05-07T19:43:01.0835198Z siblings : 48 2025-05-07T19:43:01.0835412Z core id : 14 2025-05-07T19:43:01.0835614Z cpu cores : 24 2025-05-07T19:43:01.0835914Z apicid : 28 2025-05-07T19:43:01.0836132Z initial apicid : 28 2025-05-07T19:43:01.0836360Z fpu : yes 2025-05-07T19:43:01.0836558Z fpu_exception : yes 2025-05-07T19:43:01.0836791Z cpuid level : 13 2025-05-07T19:43:01.0836995Z wp : yes 2025-05-07T19:43:01.0839288Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0841901Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0842478Z bogomips : 5999.98 2025-05-07T19:43:01.0842697Z clflush size : 64 2025-05-07T19:43:01.0842914Z cache_alignment : 64 2025-05-07T19:43:01.0843198Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0843535Z power management: 2025-05-07T19:43:01.0843728Z 2025-05-07T19:43:01.0843814Z processor : 15 2025-05-07T19:43:01.0844051Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0844296Z cpu family : 6 2025-05-07T19:43:01.0844518Z model : 85 2025-05-07T19:43:01.0844795Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0845160Z stepping : 7 2025-05-07T19:43:01.0845370Z microcode : 0x5003901 2025-05-07T19:43:01.0845609Z cpu MHz : 3277.690 2025-05-07T19:43:01.0845978Z cache size : 36608 KB 2025-05-07T19:43:01.0846218Z physical id : 0 2025-05-07T19:43:01.0846647Z siblings : 48 2025-05-07T19:43:01.0846851Z core id : 15 2025-05-07T19:43:01.0847068Z cpu cores : 24 2025-05-07T19:43:01.0847273Z apicid : 30 2025-05-07T19:43:01.0847606Z initial apicid : 30 2025-05-07T19:43:01.0847826Z fpu : yes 2025-05-07T19:43:01.0848045Z fpu_exception : yes 2025-05-07T19:43:01.0848265Z cpuid level : 13 2025-05-07T19:43:01.0848490Z wp : yes 2025-05-07T19:43:01.0850729Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0853345Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0853940Z bogomips : 5999.98 2025-05-07T19:43:01.0854157Z clflush size : 64 2025-05-07T19:43:01.0854395Z cache_alignment : 64 2025-05-07T19:43:01.0854679Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0855003Z power management: 2025-05-07T19:43:01.0855138Z 2025-05-07T19:43:01.0855241Z processor : 16 2025-05-07T19:43:01.0855461Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0855715Z cpu family : 6 2025-05-07T19:43:01.0855918Z model : 85 2025-05-07T19:43:01.0856202Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0856553Z stepping : 7 2025-05-07T19:43:01.0856775Z microcode : 0x5003901 2025-05-07T19:43:01.0857002Z cpu MHz : 3188.769 2025-05-07T19:43:01.0857231Z cache size : 36608 KB 2025-05-07T19:43:01.0857463Z physical id : 0 2025-05-07T19:43:01.0857689Z siblings : 48 2025-05-07T19:43:01.0857908Z core id : 16 2025-05-07T19:43:01.0858111Z cpu cores : 24 2025-05-07T19:43:01.0858328Z apicid : 32 2025-05-07T19:43:01.0858528Z initial apicid : 32 2025-05-07T19:43:01.0858754Z fpu : yes 2025-05-07T19:43:01.0858951Z fpu_exception : yes 2025-05-07T19:43:01.0859179Z cpuid level : 13 2025-05-07T19:43:01.0859389Z wp : yes 2025-05-07T19:43:01.0862076Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0864722Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0865319Z bogomips : 5999.98 2025-05-07T19:43:01.0865541Z clflush size : 64 2025-05-07T19:43:01.0865771Z cache_alignment : 64 2025-05-07T19:43:01.0866044Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0866381Z power management: 2025-05-07T19:43:01.0866519Z 2025-05-07T19:43:01.0866605Z processor : 17 2025-05-07T19:43:01.0866836Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0867086Z cpu family : 6 2025-05-07T19:43:01.0867286Z model : 85 2025-05-07T19:43:01.0867569Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0867916Z stepping : 7 2025-05-07T19:43:01.0868131Z microcode : 0x5003901 2025-05-07T19:43:01.0868357Z cpu MHz : 3236.084 2025-05-07T19:43:01.0868584Z cache size : 36608 KB 2025-05-07T19:43:01.0868808Z physical id : 0 2025-05-07T19:43:01.0869137Z siblings : 48 2025-05-07T19:43:01.0869340Z core id : 17 2025-05-07T19:43:01.0869560Z cpu cores : 24 2025-05-07T19:43:01.0869766Z apicid : 34 2025-05-07T19:43:01.0869988Z initial apicid : 34 2025-05-07T19:43:01.0870219Z fpu : yes 2025-05-07T19:43:01.0870493Z fpu_exception : yes 2025-05-07T19:43:01.0870725Z cpuid level : 13 2025-05-07T19:43:01.0870931Z wp : yes 2025-05-07T19:43:01.0873247Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0875787Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0876596Z bogomips : 5999.98 2025-05-07T19:43:01.0876829Z clflush size : 64 2025-05-07T19:43:01.0877084Z cache_alignment : 64 2025-05-07T19:43:01.0877369Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0877692Z power management: 2025-05-07T19:43:01.0877843Z 2025-05-07T19:43:01.0877930Z processor : 18 2025-05-07T19:43:01.0878149Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0878399Z cpu family : 6 2025-05-07T19:43:01.0878617Z model : 85 2025-05-07T19:43:01.0878888Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0879249Z stepping : 7 2025-05-07T19:43:01.0879453Z microcode : 0x5003901 2025-05-07T19:43:01.0879691Z cpu MHz : 3235.160 2025-05-07T19:43:01.0879903Z cache size : 36608 KB 2025-05-07T19:43:01.0880140Z physical id : 0 2025-05-07T19:43:01.0880345Z siblings : 48 2025-05-07T19:43:01.0880555Z core id : 18 2025-05-07T19:43:01.0880759Z cpu cores : 24 2025-05-07T19:43:01.0880969Z apicid : 36 2025-05-07T19:43:01.0881170Z initial apicid : 36 2025-05-07T19:43:01.0881393Z fpu : yes 2025-05-07T19:43:01.0881602Z fpu_exception : yes 2025-05-07T19:43:01.0881823Z cpuid level : 13 2025-05-07T19:43:01.0882047Z wp : yes 2025-05-07T19:43:01.0884287Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0886894Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0887491Z bogomips : 5999.98 2025-05-07T19:43:01.0887708Z clflush size : 64 2025-05-07T19:43:01.0887940Z cache_alignment : 64 2025-05-07T19:43:01.0888216Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0888660Z power management: 2025-05-07T19:43:01.0888784Z 2025-05-07T19:43:01.0888867Z processor : 19 2025-05-07T19:43:01.0889080Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0889315Z cpu family : 6 2025-05-07T19:43:01.0889502Z model : 85 2025-05-07T19:43:01.0889767Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0890090Z stepping : 7 2025-05-07T19:43:01.0890293Z microcode : 0x5003901 2025-05-07T19:43:01.0890503Z cpu MHz : 3235.764 2025-05-07T19:43:01.0890714Z cache size : 36608 KB 2025-05-07T19:43:01.0890921Z physical id : 0 2025-05-07T19:43:01.0891125Z siblings : 48 2025-05-07T19:43:01.0891311Z core id : 19 2025-05-07T19:43:01.0891507Z cpu cores : 24 2025-05-07T19:43:01.0891762Z apicid : 38 2025-05-07T19:43:01.0891967Z initial apicid : 38 2025-05-07T19:43:01.0892177Z fpu : yes 2025-05-07T19:43:01.0892352Z fpu_exception : yes 2025-05-07T19:43:01.0892559Z cpuid level : 13 2025-05-07T19:43:01.0892745Z wp : yes 2025-05-07T19:43:01.0898487Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0901046Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0901583Z bogomips : 5999.98 2025-05-07T19:43:01.0901787Z clflush size : 64 2025-05-07T19:43:01.0901987Z cache_alignment : 64 2025-05-07T19:43:01.0902244Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0902546Z power management: 2025-05-07T19:43:01.0902678Z 2025-05-07T19:43:01.0902751Z processor : 20 2025-05-07T19:43:01.0902950Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0903164Z cpu family : 6 2025-05-07T19:43:01.0903351Z model : 85 2025-05-07T19:43:01.0903594Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0903923Z stepping : 7 2025-05-07T19:43:01.0904104Z microcode : 0x5003901 2025-05-07T19:43:01.0904317Z cpu MHz : 3191.891 2025-05-07T19:43:01.0904508Z cache size : 36608 KB 2025-05-07T19:43:01.0904718Z physical id : 0 2025-05-07T19:43:01.0904899Z siblings : 48 2025-05-07T19:43:01.0905093Z core id : 20 2025-05-07T19:43:01.0905273Z cpu cores : 24 2025-05-07T19:43:01.0905467Z apicid : 40 2025-05-07T19:43:01.0905648Z initial apicid : 40 2025-05-07T19:43:01.0905845Z fpu : yes 2025-05-07T19:43:01.0906030Z fpu_exception : yes 2025-05-07T19:43:01.0906223Z cpuid level : 13 2025-05-07T19:43:01.0906415Z wp : yes 2025-05-07T19:43:01.0908481Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0910877Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0911415Z bogomips : 5999.98 2025-05-07T19:43:01.0911607Z clflush size : 64 2025-05-07T19:43:01.0911805Z cache_alignment : 64 2025-05-07T19:43:01.0912064Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0912363Z power management: 2025-05-07T19:43:01.0912503Z 2025-05-07T19:43:01.0912583Z processor : 21 2025-05-07T19:43:01.0912782Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0913025Z cpu family : 6 2025-05-07T19:43:01.0913227Z model : 85 2025-05-07T19:43:01.0913488Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0913823Z stepping : 7 2025-05-07T19:43:01.0914019Z microcode : 0x5003901 2025-05-07T19:43:01.0914245Z cpu MHz : 2999.994 2025-05-07T19:43:01.0914446Z cache size : 36608 KB 2025-05-07T19:43:01.0914672Z physical id : 0 2025-05-07T19:43:01.0914870Z siblings : 48 2025-05-07T19:43:01.0915062Z core id : 21 2025-05-07T19:43:01.0915249Z cpu cores : 24 2025-05-07T19:43:01.0915457Z apicid : 42 2025-05-07T19:43:01.0915640Z initial apicid : 42 2025-05-07T19:43:01.0916022Z fpu : yes 2025-05-07T19:43:01.0916232Z fpu_exception : yes 2025-05-07T19:43:01.0916616Z cpuid level : 13 2025-05-07T19:43:01.0916840Z wp : yes 2025-05-07T19:43:01.0919142Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0921759Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0922348Z bogomips : 5999.98 2025-05-07T19:43:01.0922565Z clflush size : 64 2025-05-07T19:43:01.0922793Z cache_alignment : 64 2025-05-07T19:43:01.0923061Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0923397Z power management: 2025-05-07T19:43:01.0923530Z 2025-05-07T19:43:01.0923618Z processor : 22 2025-05-07T19:43:01.0923852Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0924104Z cpu family : 6 2025-05-07T19:43:01.0924306Z model : 85 2025-05-07T19:43:01.0924603Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0924959Z stepping : 7 2025-05-07T19:43:01.0925176Z microcode : 0x5003901 2025-05-07T19:43:01.0925396Z cpu MHz : 2999.994 2025-05-07T19:43:01.0925628Z cache size : 36608 KB 2025-05-07T19:43:01.0925853Z physical id : 0 2025-05-07T19:43:01.0926068Z siblings : 48 2025-05-07T19:43:01.0926270Z core id : 22 2025-05-07T19:43:01.0926475Z cpu cores : 24 2025-05-07T19:43:01.0926680Z apicid : 44 2025-05-07T19:43:01.0926888Z initial apicid : 44 2025-05-07T19:43:01.0927108Z fpu : yes 2025-05-07T19:43:01.0927302Z fpu_exception : yes 2025-05-07T19:43:01.0927534Z cpuid level : 13 2025-05-07T19:43:01.0927728Z wp : yes 2025-05-07T19:43:01.0929976Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0932386Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0932917Z bogomips : 5999.98 2025-05-07T19:43:01.0933124Z clflush size : 64 2025-05-07T19:43:01.0933324Z cache_alignment : 64 2025-05-07T19:43:01.0933590Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0933888Z power management: 2025-05-07T19:43:01.0934020Z 2025-05-07T19:43:01.0934104Z processor : 23 2025-05-07T19:43:01.0934304Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0934546Z cpu family : 6 2025-05-07T19:43:01.0934746Z model : 85 2025-05-07T19:43:01.0934996Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0935326Z stepping : 7 2025-05-07T19:43:01.0935515Z microcode : 0x5003901 2025-05-07T19:43:01.0935725Z cpu MHz : 2999.994 2025-05-07T19:43:01.0935918Z cache size : 36608 KB 2025-05-07T19:43:01.0936136Z physical id : 0 2025-05-07T19:43:01.0936321Z siblings : 48 2025-05-07T19:43:01.0936515Z core id : 23 2025-05-07T19:43:01.0936695Z cpu cores : 24 2025-05-07T19:43:01.0936887Z apicid : 46 2025-05-07T19:43:01.0937071Z initial apicid : 46 2025-05-07T19:43:01.0937279Z fpu : yes 2025-05-07T19:43:01.0937473Z fpu_exception : yes 2025-05-07T19:43:01.0937669Z cpuid level : 13 2025-05-07T19:43:01.0937935Z wp : yes 2025-05-07T19:43:01.0940059Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0942458Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0943003Z bogomips : 5999.98 2025-05-07T19:43:01.0943201Z clflush size : 64 2025-05-07T19:43:01.0943408Z cache_alignment : 64 2025-05-07T19:43:01.0943646Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0943953Z power management: 2025-05-07T19:43:01.0944073Z 2025-05-07T19:43:01.0944148Z processor : 24 2025-05-07T19:43:01.0944351Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0944576Z cpu family : 6 2025-05-07T19:43:01.0944759Z model : 85 2025-05-07T19:43:01.0945008Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0945325Z stepping : 7 2025-05-07T19:43:01.0945516Z microcode : 0x5003901 2025-05-07T19:43:01.0945716Z cpu MHz : 2999.994 2025-05-07T19:43:01.0945917Z cache size : 36608 KB 2025-05-07T19:43:01.0946118Z physical id : 1 2025-05-07T19:43:01.0946312Z siblings : 48 2025-05-07T19:43:01.0946634Z core id : 0 2025-05-07T19:43:01.0946995Z cpu cores : 24 2025-05-07T19:43:01.0947183Z apicid : 64 2025-05-07T19:43:01.0947485Z initial apicid : 64 2025-05-07T19:43:01.0947696Z fpu : yes 2025-05-07T19:43:01.0947883Z fpu_exception : yes 2025-05-07T19:43:01.0948101Z cpuid level : 13 2025-05-07T19:43:01.0948296Z wp : yes 2025-05-07T19:43:01.0950544Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0953144Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0953716Z bogomips : 5999.98 2025-05-07T19:43:01.0953924Z clflush size : 64 2025-05-07T19:43:01.0954132Z cache_alignment : 64 2025-05-07T19:43:01.0954395Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0954713Z power management: 2025-05-07T19:43:01.0954847Z 2025-05-07T19:43:01.0954926Z processor : 25 2025-05-07T19:43:01.0955142Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0955373Z cpu family : 6 2025-05-07T19:43:01.0955575Z model : 85 2025-05-07T19:43:01.0955903Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0956257Z stepping : 7 2025-05-07T19:43:01.0956453Z microcode : 0x5003901 2025-05-07T19:43:01.0956677Z cpu MHz : 1200.483 2025-05-07T19:43:01.0956895Z cache size : 36608 KB 2025-05-07T19:43:01.0957137Z physical id : 1 2025-05-07T19:43:01.0957347Z siblings : 48 2025-05-07T19:43:01.0957571Z core id : 1 2025-05-07T19:43:01.0957768Z cpu cores : 24 2025-05-07T19:43:01.0957990Z apicid : 66 2025-05-07T19:43:01.0958199Z initial apicid : 66 2025-05-07T19:43:01.0958433Z fpu : yes 2025-05-07T19:43:01.0958650Z fpu_exception : yes 2025-05-07T19:43:01.0958871Z cpuid level : 13 2025-05-07T19:43:01.0959098Z wp : yes 2025-05-07T19:43:01.0961628Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0964328Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0964916Z bogomips : 5999.98 2025-05-07T19:43:01.0965129Z clflush size : 64 2025-05-07T19:43:01.0965361Z cache_alignment : 64 2025-05-07T19:43:01.0965627Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0965961Z power management: 2025-05-07T19:43:01.0966096Z 2025-05-07T19:43:01.0966180Z processor : 26 2025-05-07T19:43:01.0966406Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0966655Z cpu family : 6 2025-05-07T19:43:01.0966854Z model : 85 2025-05-07T19:43:01.0967138Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0967483Z stepping : 7 2025-05-07T19:43:01.0967701Z microcode : 0x5003901 2025-05-07T19:43:01.0968028Z cpu MHz : 2999.994 2025-05-07T19:43:01.0968243Z cache size : 36608 KB 2025-05-07T19:43:01.0968452Z physical id : 1 2025-05-07T19:43:01.0968663Z siblings : 48 2025-05-07T19:43:01.0968847Z core id : 2 2025-05-07T19:43:01.0969040Z cpu cores : 24 2025-05-07T19:43:01.0969227Z apicid : 68 2025-05-07T19:43:01.0969424Z initial apicid : 68 2025-05-07T19:43:01.0969634Z fpu : yes 2025-05-07T19:43:01.0969817Z fpu_exception : yes 2025-05-07T19:43:01.0970028Z cpuid level : 13 2025-05-07T19:43:01.0970220Z wp : yes 2025-05-07T19:43:01.0972306Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0974719Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0975253Z bogomips : 5999.98 2025-05-07T19:43:01.0975468Z clflush size : 64 2025-05-07T19:43:01.0975670Z cache_alignment : 64 2025-05-07T19:43:01.0975931Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0976228Z power management: 2025-05-07T19:43:01.0976363Z 2025-05-07T19:43:01.0976443Z processor : 27 2025-05-07T19:43:01.0976658Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0976879Z cpu family : 6 2025-05-07T19:43:01.0977080Z model : 85 2025-05-07T19:43:01.0977331Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0977670Z stepping : 7 2025-05-07T19:43:01.0977860Z microcode : 0x5003901 2025-05-07T19:43:01.0978083Z cpu MHz : 1199.889 2025-05-07T19:43:01.0978283Z cache size : 36608 KB 2025-05-07T19:43:01.0978503Z physical id : 1 2025-05-07T19:43:01.0978697Z siblings : 48 2025-05-07T19:43:01.0978899Z core id : 3 2025-05-07T19:43:01.0979080Z cpu cores : 24 2025-05-07T19:43:01.0979281Z apicid : 70 2025-05-07T19:43:01.0979468Z initial apicid : 70 2025-05-07T19:43:01.0979679Z fpu : yes 2025-05-07T19:43:01.0979875Z fpu_exception : yes 2025-05-07T19:43:01.0980073Z cpuid level : 13 2025-05-07T19:43:01.0980276Z wp : yes 2025-05-07T19:43:01.0982418Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0984885Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0985432Z bogomips : 5999.98 2025-05-07T19:43:01.0985632Z clflush size : 64 2025-05-07T19:43:01.0985847Z cache_alignment : 64 2025-05-07T19:43:01.0986098Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0986409Z power management: 2025-05-07T19:43:01.0986531Z 2025-05-07T19:43:01.0986611Z processor : 28 2025-05-07T19:43:01.0986826Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0987062Z cpu family : 6 2025-05-07T19:43:01.0987252Z model : 85 2025-05-07T19:43:01.0987520Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0987852Z stepping : 7 2025-05-07T19:43:01.0988065Z microcode : 0x5003901 2025-05-07T19:43:01.0988276Z cpu MHz : 1199.928 2025-05-07T19:43:01.0988491Z cache size : 36608 KB 2025-05-07T19:43:01.0988698Z physical id : 1 2025-05-07T19:43:01.0988905Z siblings : 48 2025-05-07T19:43:01.0989088Z core id : 4 2025-05-07T19:43:01.0989283Z cpu cores : 24 2025-05-07T19:43:01.0989470Z apicid : 72 2025-05-07T19:43:01.0989668Z initial apicid : 72 2025-05-07T19:43:01.0989875Z fpu : yes 2025-05-07T19:43:01.0990056Z fpu_exception : yes 2025-05-07T19:43:01.0990262Z cpuid level : 13 2025-05-07T19:43:01.0990442Z wp : yes 2025-05-07T19:43:01.0992514Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.0994910Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.0995439Z bogomips : 5999.98 2025-05-07T19:43:01.0995646Z clflush size : 64 2025-05-07T19:43:01.0995912Z cache_alignment : 64 2025-05-07T19:43:01.0996174Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.0996646Z power management: 2025-05-07T19:43:01.0996791Z 2025-05-07T19:43:01.0996873Z processor : 29 2025-05-07T19:43:01.0997090Z vendor_id : GenuineIntel 2025-05-07T19:43:01.0997405Z cpu family : 6 2025-05-07T19:43:01.0997614Z model : 85 2025-05-07T19:43:01.0997882Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.0998235Z stepping : 7 2025-05-07T19:43:01.0998433Z microcode : 0x5003901 2025-05-07T19:43:01.0998672Z cpu MHz : 1198.798 2025-05-07T19:43:01.0998881Z cache size : 36608 KB 2025-05-07T19:43:01.0999108Z physical id : 1 2025-05-07T19:43:01.0999312Z siblings : 48 2025-05-07T19:43:01.0999520Z core id : 5 2025-05-07T19:43:01.0999708Z cpu cores : 24 2025-05-07T19:43:01.0999920Z apicid : 74 2025-05-07T19:43:01.1000116Z initial apicid : 74 2025-05-07T19:43:01.1000330Z fpu : yes 2025-05-07T19:43:01.1000527Z fpu_exception : yes 2025-05-07T19:43:01.1000732Z cpuid level : 13 2025-05-07T19:43:01.1000939Z wp : yes 2025-05-07T19:43:01.1003277Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1005934Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1006513Z bogomips : 5999.98 2025-05-07T19:43:01.1006722Z clflush size : 64 2025-05-07T19:43:01.1006944Z cache_alignment : 64 2025-05-07T19:43:01.1007204Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1007531Z power management: 2025-05-07T19:43:01.1007661Z 2025-05-07T19:43:01.1007740Z processor : 30 2025-05-07T19:43:01.1007957Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1008192Z cpu family : 6 2025-05-07T19:43:01.1008385Z model : 85 2025-05-07T19:43:01.1008753Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1009073Z stepping : 7 2025-05-07T19:43:01.1009259Z microcode : 0x5003901 2025-05-07T19:43:01.1009460Z cpu MHz : 1200.770 2025-05-07T19:43:01.1009665Z cache size : 36608 KB 2025-05-07T19:43:01.1009866Z physical id : 1 2025-05-07T19:43:01.1010062Z siblings : 48 2025-05-07T19:43:01.1010241Z core id : 6 2025-05-07T19:43:01.1010425Z cpu cores : 24 2025-05-07T19:43:01.1010609Z apicid : 76 2025-05-07T19:43:01.1010794Z initial apicid : 76 2025-05-07T19:43:01.1010988Z fpu : yes 2025-05-07T19:43:01.1011163Z fpu_exception : yes 2025-05-07T19:43:01.1011362Z cpuid level : 13 2025-05-07T19:43:01.1011542Z wp : yes 2025-05-07T19:43:01.1013617Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1016086Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1016615Z bogomips : 5999.98 2025-05-07T19:43:01.1016822Z clflush size : 64 2025-05-07T19:43:01.1017017Z cache_alignment : 64 2025-05-07T19:43:01.1017273Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1017563Z power management: 2025-05-07T19:43:01.1017698Z 2025-05-07T19:43:01.1017776Z processor : 31 2025-05-07T19:43:01.1017977Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1018189Z cpu family : 6 2025-05-07T19:43:01.1018376Z model : 85 2025-05-07T19:43:01.1018614Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1018936Z stepping : 7 2025-05-07T19:43:01.1019122Z microcode : 0x5003901 2025-05-07T19:43:01.1019324Z cpu MHz : 2999.994 2025-05-07T19:43:01.1019512Z cache size : 36608 KB 2025-05-07T19:43:01.1019730Z physical id : 1 2025-05-07T19:43:01.1019911Z siblings : 48 2025-05-07T19:43:01.1020091Z core id : 7 2025-05-07T19:43:01.1020260Z cpu cores : 24 2025-05-07T19:43:01.1020443Z apicid : 78 2025-05-07T19:43:01.1020623Z initial apicid : 78 2025-05-07T19:43:01.1020815Z fpu : yes 2025-05-07T19:43:01.1020993Z fpu_exception : yes 2025-05-07T19:43:01.1021183Z cpuid level : 13 2025-05-07T19:43:01.1021371Z wp : yes 2025-05-07T19:43:01.1023487Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1025942Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1026475Z bogomips : 5999.98 2025-05-07T19:43:01.1026667Z clflush size : 64 2025-05-07T19:43:01.1026863Z cache_alignment : 64 2025-05-07T19:43:01.1027104Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1027404Z power management: 2025-05-07T19:43:01.1027525Z 2025-05-07T19:43:01.1027601Z processor : 32 2025-05-07T19:43:01.1027800Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1028015Z cpu family : 6 2025-05-07T19:43:01.1028192Z model : 85 2025-05-07T19:43:01.1028440Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1028754Z stepping : 7 2025-05-07T19:43:01.1028948Z microcode : 0x5003901 2025-05-07T19:43:01.1029148Z cpu MHz : 1200.020 2025-05-07T19:43:01.1029360Z cache size : 36608 KB 2025-05-07T19:43:01.1029566Z physical id : 1 2025-05-07T19:43:01.1029777Z siblings : 48 2025-05-07T19:43:01.1029967Z core id : 8 2025-05-07T19:43:01.1030161Z cpu cores : 24 2025-05-07T19:43:01.1030345Z apicid : 80 2025-05-07T19:43:01.1030533Z initial apicid : 80 2025-05-07T19:43:01.1030756Z fpu : yes 2025-05-07T19:43:01.1030941Z fpu_exception : yes 2025-05-07T19:43:01.1031162Z cpuid level : 13 2025-05-07T19:43:01.1031348Z wp : yes 2025-05-07T19:43:01.1033430Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1035912Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1036623Z bogomips : 5999.98 2025-05-07T19:43:01.1036854Z clflush size : 64 2025-05-07T19:43:01.1037076Z cache_alignment : 64 2025-05-07T19:43:01.1037372Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1037706Z power management: 2025-05-07T19:43:01.1037847Z 2025-05-07T19:43:01.1037934Z processor : 33 2025-05-07T19:43:01.1038159Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1038398Z cpu family : 6 2025-05-07T19:43:01.1038623Z model : 85 2025-05-07T19:43:01.1038896Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1039272Z stepping : 7 2025-05-07T19:43:01.1039472Z microcode : 0x5003901 2025-05-07T19:43:01.1039710Z cpu MHz : 2999.994 2025-05-07T19:43:01.1039921Z cache size : 36608 KB 2025-05-07T19:43:01.1040167Z physical id : 1 2025-05-07T19:43:01.1040377Z siblings : 48 2025-05-07T19:43:01.1040586Z core id : 9 2025-05-07T19:43:01.1040784Z cpu cores : 24 2025-05-07T19:43:01.1040998Z apicid : 82 2025-05-07T19:43:01.1041207Z initial apicid : 82 2025-05-07T19:43:01.1041429Z fpu : yes 2025-05-07T19:43:01.1041646Z fpu_exception : yes 2025-05-07T19:43:01.1041858Z cpuid level : 13 2025-05-07T19:43:01.1042083Z wp : yes 2025-05-07T19:43:01.1044326Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1047227Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1047814Z bogomips : 5999.98 2025-05-07T19:43:01.1048022Z clflush size : 64 2025-05-07T19:43:01.1048237Z cache_alignment : 64 2025-05-07T19:43:01.1048492Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1048808Z power management: 2025-05-07T19:43:01.1048935Z 2025-05-07T19:43:01.1049017Z processor : 34 2025-05-07T19:43:01.1049230Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1049465Z cpu family : 6 2025-05-07T19:43:01.1049660Z model : 85 2025-05-07T19:43:01.1049927Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1050267Z stepping : 7 2025-05-07T19:43:01.1050470Z microcode : 0x5003901 2025-05-07T19:43:01.1050687Z cpu MHz : 2999.994 2025-05-07T19:43:01.1050902Z cache size : 36608 KB 2025-05-07T19:43:01.1051117Z physical id : 1 2025-05-07T19:43:01.1051328Z siblings : 48 2025-05-07T19:43:01.1051521Z core id : 10 2025-05-07T19:43:01.1051722Z cpu cores : 24 2025-05-07T19:43:01.1051921Z apicid : 84 2025-05-07T19:43:01.1052122Z initial apicid : 84 2025-05-07T19:43:01.1052338Z fpu : yes 2025-05-07T19:43:01.1052526Z fpu_exception : yes 2025-05-07T19:43:01.1052745Z cpuid level : 13 2025-05-07T19:43:01.1052822Z wp : yes 2025-05-07T19:43:01.1054954Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1055350Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1055436Z bogomips : 5999.98 2025-05-07T19:43:01.1055516Z clflush size : 64 2025-05-07T19:43:01.1055611Z cache_alignment : 64 2025-05-07T19:43:01.1055739Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1055823Z power management: 2025-05-07T19:43:01.1055827Z 2025-05-07T19:43:01.1055921Z processor : 35 2025-05-07T19:43:01.1056013Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1056091Z cpu family : 6 2025-05-07T19:43:01.1056174Z model : 85 2025-05-07T19:43:01.1056345Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1056426Z stepping : 7 2025-05-07T19:43:01.1056512Z microcode : 0x5003901 2025-05-07T19:43:01.1056597Z cpu MHz : 2999.994 2025-05-07T19:43:01.1056693Z cache size : 36608 KB 2025-05-07T19:43:01.1056779Z physical id : 1 2025-05-07T19:43:01.1056859Z siblings : 48 2025-05-07T19:43:01.1056962Z core id : 11 2025-05-07T19:43:01.1057047Z cpu cores : 24 2025-05-07T19:43:01.1057131Z apicid : 86 2025-05-07T19:43:01.1057225Z initial apicid : 86 2025-05-07T19:43:01.1057320Z fpu : yes 2025-05-07T19:43:01.1057411Z fpu_exception : yes 2025-05-07T19:43:01.1057497Z cpuid level : 13 2025-05-07T19:43:01.1057596Z wp : yes 2025-05-07T19:43:01.1059737Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1060184Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1060278Z bogomips : 5999.98 2025-05-07T19:43:01.1060428Z clflush size : 64 2025-05-07T19:43:01.1060511Z cache_alignment : 64 2025-05-07T19:43:01.1060655Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1060740Z power management: 2025-05-07T19:43:01.1060744Z 2025-05-07T19:43:01.1060965Z processor : 36 2025-05-07T19:43:01.1061053Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1061143Z cpu family : 6 2025-05-07T19:43:01.1061224Z model : 85 2025-05-07T19:43:01.1061369Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1061462Z stepping : 7 2025-05-07T19:43:01.1061553Z microcode : 0x5003901 2025-05-07T19:43:01.1061632Z cpu MHz : 1199.409 2025-05-07T19:43:01.1061713Z cache size : 36608 KB 2025-05-07T19:43:01.1061812Z physical id : 1 2025-05-07T19:43:01.1061896Z siblings : 48 2025-05-07T19:43:01.1061970Z core id : 12 2025-05-07T19:43:01.1062067Z cpu cores : 24 2025-05-07T19:43:01.1062144Z apicid : 88 2025-05-07T19:43:01.1062232Z initial apicid : 88 2025-05-07T19:43:01.1062308Z fpu : yes 2025-05-07T19:43:01.1062411Z fpu_exception : yes 2025-05-07T19:43:01.1062490Z cpuid level : 13 2025-05-07T19:43:01.1062573Z wp : yes 2025-05-07T19:43:01.1064548Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1064911Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1064996Z bogomips : 5999.98 2025-05-07T19:43:01.1065092Z clflush size : 64 2025-05-07T19:43:01.1065177Z cache_alignment : 64 2025-05-07T19:43:01.1065305Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1065403Z power management: 2025-05-07T19:43:01.1065407Z 2025-05-07T19:43:01.1065483Z processor : 37 2025-05-07T19:43:01.1065573Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1065659Z cpu family : 6 2025-05-07T19:43:01.1065747Z model : 85 2025-05-07T19:43:01.1065896Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1065981Z stepping : 7 2025-05-07T19:43:01.1066075Z microcode : 0x5003901 2025-05-07T19:43:01.1066154Z cpu MHz : 1199.715 2025-05-07T19:43:01.1066239Z cache size : 36608 KB 2025-05-07T19:43:01.1066322Z physical id : 1 2025-05-07T19:43:01.1066412Z siblings : 48 2025-05-07T19:43:01.1066493Z core id : 13 2025-05-07T19:43:01.1066573Z cpu cores : 24 2025-05-07T19:43:01.1066660Z apicid : 90 2025-05-07T19:43:01.1066744Z initial apicid : 90 2025-05-07T19:43:01.1066821Z fpu : yes 2025-05-07T19:43:01.1066902Z fpu_exception : yes 2025-05-07T19:43:01.1066986Z cpuid level : 13 2025-05-07T19:43:01.1067067Z wp : yes 2025-05-07T19:43:01.1069033Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1069398Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1069537Z bogomips : 5999.98 2025-05-07T19:43:01.1069626Z clflush size : 64 2025-05-07T19:43:01.1069719Z cache_alignment : 64 2025-05-07T19:43:01.1069885Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1069971Z power management: 2025-05-07T19:43:01.1069975Z 2025-05-07T19:43:01.1070069Z processor : 38 2025-05-07T19:43:01.1070157Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1070236Z cpu family : 6 2025-05-07T19:43:01.1070312Z model : 85 2025-05-07T19:43:01.1070474Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1070552Z stepping : 7 2025-05-07T19:43:01.1070635Z microcode : 0x5003901 2025-05-07T19:43:01.1070727Z cpu MHz : 1199.721 2025-05-07T19:43:01.1070808Z cache size : 36608 KB 2025-05-07T19:43:01.1070888Z physical id : 1 2025-05-07T19:43:01.1070967Z siblings : 48 2025-05-07T19:43:01.1071055Z core id : 14 2025-05-07T19:43:01.1071137Z cpu cores : 24 2025-05-07T19:43:01.1071216Z apicid : 92 2025-05-07T19:43:01.1071316Z initial apicid : 92 2025-05-07T19:43:01.1071397Z fpu : yes 2025-05-07T19:43:01.1071483Z fpu_exception : yes 2025-05-07T19:43:01.1071564Z cpuid level : 13 2025-05-07T19:43:01.1071659Z wp : yes 2025-05-07T19:43:01.1073624Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1073996Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1074083Z bogomips : 5999.98 2025-05-07T19:43:01.1074164Z clflush size : 64 2025-05-07T19:43:01.1074248Z cache_alignment : 64 2025-05-07T19:43:01.1074393Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1074476Z power management: 2025-05-07T19:43:01.1074480Z 2025-05-07T19:43:01.1074562Z processor : 39 2025-05-07T19:43:01.1074663Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1074744Z cpu family : 6 2025-05-07T19:43:01.1074822Z model : 85 2025-05-07T19:43:01.1074974Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1075067Z stepping : 7 2025-05-07T19:43:01.1075151Z microcode : 0x5003901 2025-05-07T19:43:01.1075231Z cpu MHz : 2999.994 2025-05-07T19:43:01.1075330Z cache size : 36608 KB 2025-05-07T19:43:01.1075411Z physical id : 1 2025-05-07T19:43:01.1075490Z siblings : 48 2025-05-07T19:43:01.1075568Z core id : 15 2025-05-07T19:43:01.1075661Z cpu cores : 24 2025-05-07T19:43:01.1075744Z apicid : 94 2025-05-07T19:43:01.1075888Z initial apicid : 94 2025-05-07T19:43:01.1075986Z fpu : yes 2025-05-07T19:43:01.1076071Z fpu_exception : yes 2025-05-07T19:43:01.1076152Z cpuid level : 13 2025-05-07T19:43:01.1076229Z wp : yes 2025-05-07T19:43:01.1078546Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1078929Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1079081Z bogomips : 5999.98 2025-05-07T19:43:01.1079167Z clflush size : 64 2025-05-07T19:43:01.1079254Z cache_alignment : 64 2025-05-07T19:43:01.1079384Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1079528Z power management: 2025-05-07T19:43:01.1079533Z 2025-05-07T19:43:01.1079618Z processor : 40 2025-05-07T19:43:01.1079708Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1079804Z cpu family : 6 2025-05-07T19:43:01.1079884Z model : 85 2025-05-07T19:43:01.1080042Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1080125Z stepping : 7 2025-05-07T19:43:01.1080225Z microcode : 0x5003901 2025-05-07T19:43:01.1080306Z cpu MHz : 2999.994 2025-05-07T19:43:01.1080391Z cache size : 36608 KB 2025-05-07T19:43:01.1080487Z physical id : 1 2025-05-07T19:43:01.1080567Z siblings : 48 2025-05-07T19:43:01.1080647Z core id : 16 2025-05-07T19:43:01.1080728Z cpu cores : 24 2025-05-07T19:43:01.1080824Z apicid : 96 2025-05-07T19:43:01.1080912Z initial apicid : 96 2025-05-07T19:43:01.1080996Z fpu : yes 2025-05-07T19:43:01.1081084Z fpu_exception : yes 2025-05-07T19:43:01.1081181Z cpuid level : 13 2025-05-07T19:43:01.1081262Z wp : yes 2025-05-07T19:43:01.1083395Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1083796Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1083883Z bogomips : 5999.98 2025-05-07T19:43:01.1083971Z clflush size : 64 2025-05-07T19:43:01.1084073Z cache_alignment : 64 2025-05-07T19:43:01.1084205Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1084293Z power management: 2025-05-07T19:43:01.1084297Z 2025-05-07T19:43:01.1084396Z processor : 41 2025-05-07T19:43:01.1084489Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1084571Z cpu family : 6 2025-05-07T19:43:01.1084665Z model : 85 2025-05-07T19:43:01.1084825Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1084907Z stepping : 7 2025-05-07T19:43:01.1084994Z microcode : 0x5003901 2025-05-07T19:43:01.1085091Z cpu MHz : 2999.994 2025-05-07T19:43:01.1085176Z cache size : 36608 KB 2025-05-07T19:43:01.1085261Z physical id : 1 2025-05-07T19:43:01.1085343Z siblings : 48 2025-05-07T19:43:01.1085439Z core id : 17 2025-05-07T19:43:01.1085522Z cpu cores : 24 2025-05-07T19:43:01.1085605Z apicid : 98 2025-05-07T19:43:01.1085703Z initial apicid : 98 2025-05-07T19:43:01.1085784Z fpu : yes 2025-05-07T19:43:01.1085878Z fpu_exception : yes 2025-05-07T19:43:01.1085964Z cpuid level : 13 2025-05-07T19:43:01.1086057Z wp : yes 2025-05-07T19:43:01.1088188Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1088688Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1088767Z bogomips : 5999.98 2025-05-07T19:43:01.1088848Z clflush size : 64 2025-05-07T19:43:01.1088991Z cache_alignment : 64 2025-05-07T19:43:01.1089126Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1089208Z power management: 2025-05-07T19:43:01.1089212Z 2025-05-07T19:43:01.1089292Z processor : 42 2025-05-07T19:43:01.1089439Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1089519Z cpu family : 6 2025-05-07T19:43:01.1089598Z model : 85 2025-05-07T19:43:01.1089749Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1089843Z stepping : 7 2025-05-07T19:43:01.1089925Z microcode : 0x5003901 2025-05-07T19:43:01.1090004Z cpu MHz : 2999.994 2025-05-07T19:43:01.1090096Z cache size : 36608 KB 2025-05-07T19:43:01.1090174Z physical id : 1 2025-05-07T19:43:01.1090250Z siblings : 48 2025-05-07T19:43:01.1090327Z core id : 18 2025-05-07T19:43:01.1090419Z cpu cores : 24 2025-05-07T19:43:01.1090496Z apicid : 100 2025-05-07T19:43:01.1090578Z initial apicid : 100 2025-05-07T19:43:01.1090665Z fpu : yes 2025-05-07T19:43:01.1090747Z fpu_exception : yes 2025-05-07T19:43:01.1090830Z cpuid level : 13 2025-05-07T19:43:01.1090905Z wp : yes 2025-05-07T19:43:01.1092884Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1093240Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1093333Z bogomips : 5999.98 2025-05-07T19:43:01.1093413Z clflush size : 64 2025-05-07T19:43:01.1093496Z cache_alignment : 64 2025-05-07T19:43:01.1093620Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1093714Z power management: 2025-05-07T19:43:01.1093718Z 2025-05-07T19:43:01.1093796Z processor : 43 2025-05-07T19:43:01.1093883Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1093976Z cpu family : 6 2025-05-07T19:43:01.1094052Z model : 85 2025-05-07T19:43:01.1094201Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1094279Z stepping : 7 2025-05-07T19:43:01.1094374Z microcode : 0x5003901 2025-05-07T19:43:01.1094453Z cpu MHz : 2999.994 2025-05-07T19:43:01.1094533Z cache size : 36608 KB 2025-05-07T19:43:01.1094625Z physical id : 1 2025-05-07T19:43:01.1094701Z siblings : 48 2025-05-07T19:43:01.1094777Z core id : 19 2025-05-07T19:43:01.1094853Z cpu cores : 24 2025-05-07T19:43:01.1094942Z apicid : 102 2025-05-07T19:43:01.1095023Z initial apicid : 102 2025-05-07T19:43:01.1095098Z fpu : yes 2025-05-07T19:43:01.1095191Z fpu_exception : yes 2025-05-07T19:43:01.1095269Z cpuid level : 13 2025-05-07T19:43:01.1095345Z wp : yes 2025-05-07T19:43:01.1097327Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1097683Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1097763Z bogomips : 5999.98 2025-05-07T19:43:01.1097855Z clflush size : 64 2025-05-07T19:43:01.1097936Z cache_alignment : 64 2025-05-07T19:43:01.1098058Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1098186Z power management: 2025-05-07T19:43:01.1098190Z 2025-05-07T19:43:01.1098282Z processor : 44 2025-05-07T19:43:01.1098368Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1098444Z cpu family : 6 2025-05-07T19:43:01.1098576Z model : 85 2025-05-07T19:43:01.1098726Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1098804Z stepping : 7 2025-05-07T19:43:01.1098885Z microcode : 0x5003901 2025-05-07T19:43:01.1098976Z cpu MHz : 2999.994 2025-05-07T19:43:01.1099055Z cache size : 36608 KB 2025-05-07T19:43:01.1099135Z physical id : 1 2025-05-07T19:43:01.1099225Z siblings : 48 2025-05-07T19:43:01.1099303Z core id : 20 2025-05-07T19:43:01.1099381Z cpu cores : 24 2025-05-07T19:43:01.1099460Z apicid : 104 2025-05-07T19:43:01.1099557Z initial apicid : 104 2025-05-07T19:43:01.1099636Z fpu : yes 2025-05-07T19:43:01.1099719Z fpu_exception : yes 2025-05-07T19:43:01.1099811Z cpuid level : 13 2025-05-07T19:43:01.1099886Z wp : yes 2025-05-07T19:43:01.1101862Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1102231Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1102310Z bogomips : 5999.98 2025-05-07T19:43:01.1102388Z clflush size : 64 2025-05-07T19:43:01.1102482Z cache_alignment : 64 2025-05-07T19:43:01.1102605Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1102689Z power management: 2025-05-07T19:43:01.1102693Z 2025-05-07T19:43:01.1102772Z processor : 45 2025-05-07T19:43:01.1102870Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1102948Z cpu family : 6 2025-05-07T19:43:01.1103024Z model : 85 2025-05-07T19:43:01.1103190Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1103268Z stepping : 7 2025-05-07T19:43:01.1103349Z microcode : 0x5003901 2025-05-07T19:43:01.1103426Z cpu MHz : 2999.994 2025-05-07T19:43:01.1103521Z cache size : 36608 KB 2025-05-07T19:43:01.1103603Z physical id : 1 2025-05-07T19:43:01.1103679Z siblings : 48 2025-05-07T19:43:01.1103768Z core id : 21 2025-05-07T19:43:01.1103846Z cpu cores : 24 2025-05-07T19:43:01.1103922Z apicid : 106 2025-05-07T19:43:01.1104004Z initial apicid : 106 2025-05-07T19:43:01.1104092Z fpu : yes 2025-05-07T19:43:01.1104175Z fpu_exception : yes 2025-05-07T19:43:01.1104255Z cpuid level : 13 2025-05-07T19:43:01.1104347Z wp : yes 2025-05-07T19:43:01.1106321Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1106680Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1106774Z bogomips : 5999.98 2025-05-07T19:43:01.1106855Z clflush size : 64 2025-05-07T19:43:01.1106938Z cache_alignment : 64 2025-05-07T19:43:01.1107077Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1107163Z power management: 2025-05-07T19:43:01.1107168Z 2025-05-07T19:43:01.1107836Z processor : 46 2025-05-07T19:43:01.1107925Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1108023Z cpu family : 6 2025-05-07T19:43:01.1108100Z model : 85 2025-05-07T19:43:01.1108302Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1108397Z stepping : 7 2025-05-07T19:43:01.1108480Z microcode : 0x5003901 2025-05-07T19:43:01.1108561Z cpu MHz : 1207.777 2025-05-07T19:43:01.1108642Z cache size : 36608 KB 2025-05-07T19:43:01.1108740Z physical id : 1 2025-05-07T19:43:01.1108819Z siblings : 48 2025-05-07T19:43:01.1108896Z core id : 22 2025-05-07T19:43:01.1108988Z cpu cores : 24 2025-05-07T19:43:01.1109065Z apicid : 108 2025-05-07T19:43:01.1109150Z initial apicid : 108 2025-05-07T19:43:01.1109227Z fpu : yes 2025-05-07T19:43:01.1109327Z fpu_exception : yes 2025-05-07T19:43:01.1109408Z cpuid level : 13 2025-05-07T19:43:01.1109482Z wp : yes 2025-05-07T19:43:01.1111464Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1111827Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1111908Z bogomips : 5999.98 2025-05-07T19:43:01.1112000Z clflush size : 64 2025-05-07T19:43:01.1112084Z cache_alignment : 64 2025-05-07T19:43:01.1112208Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1112310Z power management: 2025-05-07T19:43:01.1112314Z 2025-05-07T19:43:01.1112395Z processor : 47 2025-05-07T19:43:01.1112486Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1112565Z cpu family : 6 2025-05-07T19:43:01.1112660Z model : 85 2025-05-07T19:43:01.1112811Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1112894Z stepping : 7 2025-05-07T19:43:01.1112992Z microcode : 0x5003901 2025-05-07T19:43:01.1113072Z cpu MHz : 2999.994 2025-05-07T19:43:01.1113155Z cache size : 36608 KB 2025-05-07T19:43:01.1113233Z physical id : 1 2025-05-07T19:43:01.1113323Z siblings : 48 2025-05-07T19:43:01.1113398Z core id : 23 2025-05-07T19:43:01.1113476Z cpu cores : 24 2025-05-07T19:43:01.1113553Z apicid : 110 2025-05-07T19:43:01.1113648Z initial apicid : 110 2025-05-07T19:43:01.1113722Z fpu : yes 2025-05-07T19:43:01.1113804Z fpu_exception : yes 2025-05-07T19:43:01.1113894Z cpuid level : 13 2025-05-07T19:43:01.1113969Z wp : yes 2025-05-07T19:43:01.1116009Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1116558Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1116643Z bogomips : 5999.98 2025-05-07T19:43:01.1116728Z clflush size : 64 2025-05-07T19:43:01.1116827Z cache_alignment : 64 2025-05-07T19:43:01.1116961Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1117047Z power management: 2025-05-07T19:43:01.1117051Z 2025-05-07T19:43:01.1117236Z processor : 48 2025-05-07T19:43:01.1117329Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1117470Z cpu family : 6 2025-05-07T19:43:01.1117551Z model : 85 2025-05-07T19:43:01.1132107Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1132262Z stepping : 7 2025-05-07T19:43:01.1132481Z microcode : 0x5003901 2025-05-07T19:43:01.1132562Z cpu MHz : 3222.075 2025-05-07T19:43:01.1132653Z cache size : 36608 KB 2025-05-07T19:43:01.1132763Z physical id : 0 2025-05-07T19:43:01.1132834Z siblings : 48 2025-05-07T19:43:01.1132912Z core id : 0 2025-05-07T19:43:01.1132988Z cpu cores : 24 2025-05-07T19:43:01.1133059Z apicid : 1 2025-05-07T19:43:01.1133138Z initial apicid : 1 2025-05-07T19:43:01.1133224Z fpu : yes 2025-05-07T19:43:01.1133303Z fpu_exception : yes 2025-05-07T19:43:01.1133379Z cpuid level : 13 2025-05-07T19:43:01.1133453Z wp : yes 2025-05-07T19:43:01.1135447Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1135811Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1135899Z bogomips : 5999.98 2025-05-07T19:43:01.1135973Z clflush size : 64 2025-05-07T19:43:01.1136051Z cache_alignment : 64 2025-05-07T19:43:01.1136183Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1136260Z power management: 2025-05-07T19:43:01.1136266Z 2025-05-07T19:43:01.1136344Z processor : 49 2025-05-07T19:43:01.1136427Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1136510Z cpu family : 6 2025-05-07T19:43:01.1136585Z model : 85 2025-05-07T19:43:01.1136736Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1136821Z stepping : 7 2025-05-07T19:43:01.1136900Z microcode : 0x5003901 2025-05-07T19:43:01.1136974Z cpu MHz : 2999.994 2025-05-07T19:43:01.1137053Z cache size : 36608 KB 2025-05-07T19:43:01.1137137Z physical id : 0 2025-05-07T19:43:01.1137209Z siblings : 48 2025-05-07T19:43:01.1137279Z core id : 1 2025-05-07T19:43:01.1137366Z cpu cores : 24 2025-05-07T19:43:01.1137439Z apicid : 3 2025-05-07T19:43:01.1137515Z initial apicid : 3 2025-05-07T19:43:01.1137585Z fpu : yes 2025-05-07T19:43:01.1137675Z fpu_exception : yes 2025-05-07T19:43:01.1137750Z cpuid level : 13 2025-05-07T19:43:01.1137821Z wp : yes 2025-05-07T19:43:01.1139799Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1140157Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1140230Z bogomips : 5999.98 2025-05-07T19:43:01.1140315Z clflush size : 64 2025-05-07T19:43:01.1140393Z cache_alignment : 64 2025-05-07T19:43:01.1140515Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1140589Z power management: 2025-05-07T19:43:01.1140602Z 2025-05-07T19:43:01.1140676Z processor : 50 2025-05-07T19:43:01.1140760Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1140831Z cpu family : 6 2025-05-07T19:43:01.1140907Z model : 85 2025-05-07T19:43:01.1141110Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1141184Z stepping : 7 2025-05-07T19:43:01.1141272Z microcode : 0x5003901 2025-05-07T19:43:01.1141345Z cpu MHz : 2999.994 2025-05-07T19:43:01.1141422Z cache size : 36608 KB 2025-05-07T19:43:01.1141546Z physical id : 0 2025-05-07T19:43:01.1141630Z siblings : 48 2025-05-07T19:43:01.1141700Z core id : 2 2025-05-07T19:43:01.1141772Z cpu cores : 24 2025-05-07T19:43:01.1141845Z apicid : 5 2025-05-07T19:43:01.1141929Z initial apicid : 5 2025-05-07T19:43:01.1142003Z fpu : yes 2025-05-07T19:43:01.1142082Z fpu_exception : yes 2025-05-07T19:43:01.1142169Z cpuid level : 13 2025-05-07T19:43:01.1142243Z wp : yes 2025-05-07T19:43:01.1144212Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1144573Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1144650Z bogomips : 5999.98 2025-05-07T19:43:01.1144726Z clflush size : 64 2025-05-07T19:43:01.1144814Z cache_alignment : 64 2025-05-07T19:43:01.1144935Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1145015Z power management: 2025-05-07T19:43:01.1145020Z 2025-05-07T19:43:01.1145102Z processor : 51 2025-05-07T19:43:01.1145181Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1145250Z cpu family : 6 2025-05-07T19:43:01.1145319Z model : 85 2025-05-07T19:43:01.1145477Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1145555Z stepping : 7 2025-05-07T19:43:01.1145631Z microcode : 0x5003901 2025-05-07T19:43:01.1145707Z cpu MHz : 2999.994 2025-05-07T19:43:01.1145796Z cache size : 36608 KB 2025-05-07T19:43:01.1145870Z physical id : 0 2025-05-07T19:43:01.1145944Z siblings : 48 2025-05-07T19:43:01.1146024Z core id : 3 2025-05-07T19:43:01.1146098Z cpu cores : 24 2025-05-07T19:43:01.1146167Z apicid : 7 2025-05-07T19:43:01.1146242Z initial apicid : 7 2025-05-07T19:43:01.1146324Z fpu : yes 2025-05-07T19:43:01.1146547Z fpu_exception : yes 2025-05-07T19:43:01.1146621Z cpuid level : 13 2025-05-07T19:43:01.1146698Z wp : yes 2025-05-07T19:43:01.1148985Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1149373Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1149463Z bogomips : 5999.98 2025-05-07T19:43:01.1149546Z clflush size : 64 2025-05-07T19:43:01.1149628Z cache_alignment : 64 2025-05-07T19:43:01.1149763Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1149846Z power management: 2025-05-07T19:43:01.1149850Z 2025-05-07T19:43:01.1149929Z processor : 52 2025-05-07T19:43:01.1150019Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1150103Z cpu family : 6 2025-05-07T19:43:01.1150179Z model : 85 2025-05-07T19:43:01.1150339Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1150423Z stepping : 7 2025-05-07T19:43:01.1150606Z microcode : 0x5003901 2025-05-07T19:43:01.1150686Z cpu MHz : 2999.994 2025-05-07T19:43:01.1150767Z cache size : 36608 KB 2025-05-07T19:43:01.1150857Z physical id : 0 2025-05-07T19:43:01.1150935Z siblings : 48 2025-05-07T19:43:01.1151111Z core id : 4 2025-05-07T19:43:01.1151199Z cpu cores : 24 2025-05-07T19:43:01.1151276Z apicid : 9 2025-05-07T19:43:01.1151360Z initial apicid : 9 2025-05-07T19:43:01.1151438Z fpu : yes 2025-05-07T19:43:01.1151530Z fpu_exception : yes 2025-05-07T19:43:01.1151611Z cpuid level : 13 2025-05-07T19:43:01.1151688Z wp : yes 2025-05-07T19:43:01.1153821Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1154209Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1154294Z bogomips : 5999.98 2025-05-07T19:43:01.1154383Z clflush size : 64 2025-05-07T19:43:01.1154466Z cache_alignment : 64 2025-05-07T19:43:01.1154597Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1154691Z power management: 2025-05-07T19:43:01.1154695Z 2025-05-07T19:43:01.1154773Z processor : 53 2025-05-07T19:43:01.1154861Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1154941Z cpu family : 6 2025-05-07T19:43:01.1155026Z model : 85 2025-05-07T19:43:01.1155184Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1155266Z stepping : 7 2025-05-07T19:43:01.1155359Z microcode : 0x5003901 2025-05-07T19:43:01.1155440Z cpu MHz : 2999.994 2025-05-07T19:43:01.1155521Z cache size : 36608 KB 2025-05-07T19:43:01.1155602Z physical id : 0 2025-05-07T19:43:01.1155689Z siblings : 48 2025-05-07T19:43:01.1155765Z core id : 5 2025-05-07T19:43:01.1156083Z cpu cores : 24 2025-05-07T19:43:01.1156174Z apicid : 11 2025-05-07T19:43:01.1156256Z initial apicid : 11 2025-05-07T19:43:01.1156332Z fpu : yes 2025-05-07T19:43:01.1156416Z fpu_exception : yes 2025-05-07T19:43:01.1156501Z cpuid level : 13 2025-05-07T19:43:01.1156577Z wp : yes 2025-05-07T19:43:01.1158699Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1159095Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1159175Z bogomips : 5999.98 2025-05-07T19:43:01.1159255Z clflush size : 64 2025-05-07T19:43:01.1159346Z cache_alignment : 64 2025-05-07T19:43:01.1159474Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1159555Z power management: 2025-05-07T19:43:01.1159560Z 2025-05-07T19:43:01.1159647Z processor : 54 2025-05-07T19:43:01.1159734Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1159815Z cpu family : 6 2025-05-07T19:43:01.1159890Z model : 85 2025-05-07T19:43:01.1160055Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1160134Z stepping : 7 2025-05-07T19:43:01.1160216Z microcode : 0x5003901 2025-05-07T19:43:01.1160302Z cpu MHz : 3242.664 2025-05-07T19:43:01.1160439Z cache size : 36608 KB 2025-05-07T19:43:01.1160520Z physical id : 0 2025-05-07T19:43:01.1160596Z siblings : 48 2025-05-07T19:43:01.1160681Z core id : 6 2025-05-07T19:43:01.1160760Z cpu cores : 24 2025-05-07T19:43:01.1160838Z apicid : 13 2025-05-07T19:43:01.1160981Z initial apicid : 13 2025-05-07T19:43:01.1161071Z fpu : yes 2025-05-07T19:43:01.1161173Z fpu_exception : yes 2025-05-07T19:43:01.1161256Z cpuid level : 13 2025-05-07T19:43:01.1161341Z wp : yes 2025-05-07T19:43:01.1163465Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1163861Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1163947Z bogomips : 5999.98 2025-05-07T19:43:01.1164028Z clflush size : 64 2025-05-07T19:43:01.1164109Z cache_alignment : 64 2025-05-07T19:43:01.1164249Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1164335Z power management: 2025-05-07T19:43:01.1164339Z 2025-05-07T19:43:01.1164418Z processor : 55 2025-05-07T19:43:01.1164517Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1164596Z cpu family : 6 2025-05-07T19:43:01.1164672Z model : 85 2025-05-07T19:43:01.1164824Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1164909Z stepping : 7 2025-05-07T19:43:01.1164992Z microcode : 0x5003901 2025-05-07T19:43:01.1165070Z cpu MHz : 3214.630 2025-05-07T19:43:01.1165161Z cache size : 36608 KB 2025-05-07T19:43:01.1165244Z physical id : 0 2025-05-07T19:43:01.1165320Z siblings : 48 2025-05-07T19:43:01.1165396Z core id : 7 2025-05-07T19:43:01.1165485Z cpu cores : 24 2025-05-07T19:43:01.1165565Z apicid : 15 2025-05-07T19:43:01.1165648Z initial apicid : 15 2025-05-07T19:43:01.1165729Z fpu : yes 2025-05-07T19:43:01.1165821Z fpu_exception : yes 2025-05-07T19:43:01.1165901Z cpuid level : 13 2025-05-07T19:43:01.1165976Z wp : yes 2025-05-07T19:43:01.1168209Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1168562Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1168644Z bogomips : 5999.98 2025-05-07T19:43:01.1168721Z clflush size : 64 2025-05-07T19:43:01.1168799Z cache_alignment : 64 2025-05-07T19:43:01.1168918Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1169003Z power management: 2025-05-07T19:43:01.1169007Z 2025-05-07T19:43:01.1169079Z processor : 56 2025-05-07T19:43:01.1169161Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1169243Z cpu family : 6 2025-05-07T19:43:01.1169311Z model : 85 2025-05-07T19:43:01.1169454Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1169525Z stepping : 7 2025-05-07T19:43:01.1169611Z microcode : 0x5003901 2025-05-07T19:43:01.1169683Z cpu MHz : 3318.756 2025-05-07T19:43:01.1169756Z cache size : 36608 KB 2025-05-07T19:43:01.1169836Z physical id : 0 2025-05-07T19:43:01.1169954Z siblings : 48 2025-05-07T19:43:01.1170023Z core id : 8 2025-05-07T19:43:01.1170099Z cpu cores : 24 2025-05-07T19:43:01.1170178Z apicid : 17 2025-05-07T19:43:01.1170254Z initial apicid : 17 2025-05-07T19:43:01.1170323Z fpu : yes 2025-05-07T19:43:01.1170455Z fpu_exception : yes 2025-05-07T19:43:01.1170538Z cpuid level : 13 2025-05-07T19:43:01.1170609Z wp : yes 2025-05-07T19:43:01.1172568Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1172929Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1173003Z bogomips : 5999.98 2025-05-07T19:43:01.1173081Z clflush size : 64 2025-05-07T19:43:01.1173168Z cache_alignment : 64 2025-05-07T19:43:01.1173288Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1173364Z power management: 2025-05-07T19:43:01.1173368Z 2025-05-07T19:43:01.1173450Z processor : 57 2025-05-07T19:43:01.1173533Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1173607Z cpu family : 6 2025-05-07T19:43:01.1173677Z model : 85 2025-05-07T19:43:01.1173831Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1173905Z stepping : 7 2025-05-07T19:43:01.1173982Z microcode : 0x5003901 2025-05-07T19:43:01.1174061Z cpu MHz : 3337.669 2025-05-07T19:43:01.1174138Z cache size : 36608 KB 2025-05-07T19:43:01.1174214Z physical id : 0 2025-05-07T19:43:01.1174288Z siblings : 48 2025-05-07T19:43:01.1174371Z core id : 9 2025-05-07T19:43:01.1174448Z cpu cores : 24 2025-05-07T19:43:01.1174522Z apicid : 19 2025-05-07T19:43:01.1174610Z initial apicid : 19 2025-05-07T19:43:01.1174680Z fpu : yes 2025-05-07T19:43:01.1174760Z fpu_exception : yes 2025-05-07T19:43:01.1174839Z cpuid level : 13 2025-05-07T19:43:01.1174921Z wp : yes 2025-05-07T19:43:01.1176879Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1177243Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1177324Z bogomips : 5999.98 2025-05-07T19:43:01.1177399Z clflush size : 64 2025-05-07T19:43:01.1177478Z cache_alignment : 64 2025-05-07T19:43:01.1177610Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1177690Z power management: 2025-05-07T19:43:01.1177694Z 2025-05-07T19:43:01.1177768Z processor : 58 2025-05-07T19:43:01.1177857Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1177933Z cpu family : 6 2025-05-07T19:43:01.1178006Z model : 85 2025-05-07T19:43:01.1178149Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1178230Z stepping : 7 2025-05-07T19:43:01.1178308Z microcode : 0x5003901 2025-05-07T19:43:01.1178381Z cpu MHz : 3245.273 2025-05-07T19:43:01.1178467Z cache size : 36608 KB 2025-05-07T19:43:01.1178543Z physical id : 0 2025-05-07T19:43:01.1178616Z siblings : 48 2025-05-07T19:43:01.1178688Z core id : 10 2025-05-07T19:43:01.1178770Z cpu cores : 24 2025-05-07T19:43:01.1178888Z apicid : 21 2025-05-07T19:43:01.1178967Z initial apicid : 21 2025-05-07T19:43:01.1179046Z fpu : yes 2025-05-07T19:43:01.1179126Z fpu_exception : yes 2025-05-07T19:43:01.1179201Z cpuid level : 13 2025-05-07T19:43:01.1179315Z wp : yes 2025-05-07T19:43:01.1181285Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1181640Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1181727Z bogomips : 5999.98 2025-05-07T19:43:01.1181803Z clflush size : 64 2025-05-07T19:43:01.1181879Z cache_alignment : 64 2025-05-07T19:43:01.1181998Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1182087Z power management: 2025-05-07T19:43:01.1182091Z 2025-05-07T19:43:01.1182163Z processor : 59 2025-05-07T19:43:01.1182247Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1182326Z cpu family : 6 2025-05-07T19:43:01.1182400Z model : 85 2025-05-07T19:43:01.1182542Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1182613Z stepping : 7 2025-05-07T19:43:01.1182695Z microcode : 0x5003901 2025-05-07T19:43:01.1182772Z cpu MHz : 3269.437 2025-05-07T19:43:01.1182844Z cache size : 36608 KB 2025-05-07T19:43:01.1182928Z physical id : 0 2025-05-07T19:43:01.1183006Z siblings : 48 2025-05-07T19:43:01.1183079Z core id : 11 2025-05-07T19:43:01.1183153Z cpu cores : 24 2025-05-07T19:43:01.1183230Z apicid : 23 2025-05-07T19:43:01.1183307Z initial apicid : 23 2025-05-07T19:43:01.1183379Z fpu : yes 2025-05-07T19:43:01.1183469Z fpu_exception : yes 2025-05-07T19:43:01.1183826Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:01.1183900Z cpuid level : 13 2025-05-07T19:43:01.1183972Z wp : yes 2025-05-07T19:43:01.1185937Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1186286Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1186369Z bogomips : 5999.98 2025-05-07T19:43:01.1186442Z clflush size : 64 2025-05-07T19:43:01.1186518Z cache_alignment : 64 2025-05-07T19:43:01.1186640Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1186725Z power management: 2025-05-07T19:43:01.1186729Z 2025-05-07T19:43:01.1186800Z processor : 60 2025-05-07T19:43:01.1186881Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1186960Z cpu family : 6 2025-05-07T19:43:01.1187029Z model : 85 2025-05-07T19:43:01.1187171Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1187241Z stepping : 7 2025-05-07T19:43:01.1187323Z microcode : 0x5003901 2025-05-07T19:43:01.1187395Z cpu MHz : 3245.920 2025-05-07T19:43:01.1187468Z cache size : 36608 KB 2025-05-07T19:43:01.1187553Z physical id : 0 2025-05-07T19:43:01.1187623Z siblings : 48 2025-05-07T19:43:01.1187694Z core id : 12 2025-05-07T19:43:01.1187769Z cpu cores : 24 2025-05-07T19:43:01.1187897Z apicid : 25 2025-05-07T19:43:01.1187972Z initial apicid : 25 2025-05-07T19:43:01.1188043Z fpu : yes 2025-05-07T19:43:01.1188119Z fpu_exception : yes 2025-05-07T19:43:01.1188200Z cpuid level : 13 2025-05-07T19:43:01.1188272Z wp : yes 2025-05-07T19:43:01.1190282Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1190645Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1190725Z bogomips : 5999.98 2025-05-07T19:43:01.1190802Z clflush size : 64 2025-05-07T19:43:01.1190886Z cache_alignment : 64 2025-05-07T19:43:01.1191004Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1191086Z power management: 2025-05-07T19:43:01.1191090Z 2025-05-07T19:43:01.1191171Z processor : 61 2025-05-07T19:43:01.1191252Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1191323Z cpu family : 6 2025-05-07T19:43:01.1191402Z model : 85 2025-05-07T19:43:01.1191545Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1191618Z stepping : 7 2025-05-07T19:43:01.1191693Z microcode : 0x5003901 2025-05-07T19:43:01.1191770Z cpu MHz : 3213.956 2025-05-07T19:43:01.1191842Z cache size : 36608 KB 2025-05-07T19:43:01.1191913Z physical id : 0 2025-05-07T19:43:01.1191983Z siblings : 48 2025-05-07T19:43:01.1192058Z core id : 13 2025-05-07T19:43:01.1192128Z cpu cores : 24 2025-05-07T19:43:01.1192196Z apicid : 27 2025-05-07T19:43:01.1192282Z initial apicid : 27 2025-05-07T19:43:01.1192355Z fpu : yes 2025-05-07T19:43:01.1192430Z fpu_exception : yes 2025-05-07T19:43:01.1192502Z cpuid level : 13 2025-05-07T19:43:01.1192579Z wp : yes 2025-05-07T19:43:01.1194541Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1194902Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1194976Z bogomips : 5999.98 2025-05-07T19:43:01.1195051Z clflush size : 64 2025-05-07T19:43:01.1195127Z cache_alignment : 64 2025-05-07T19:43:01.1195252Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1195325Z power management: 2025-05-07T19:43:01.1195329Z 2025-05-07T19:43:01.1195403Z processor : 62 2025-05-07T19:43:01.1195490Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1195559Z cpu family : 6 2025-05-07T19:43:01.1195627Z model : 85 2025-05-07T19:43:01.1195769Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1195921Z stepping : 7 2025-05-07T19:43:01.1196000Z microcode : 0x5003901 2025-05-07T19:43:01.1196071Z cpu MHz : 3225.582 2025-05-07T19:43:01.1196153Z cache size : 36608 KB 2025-05-07T19:43:01.1196226Z physical id : 0 2025-05-07T19:43:01.1196465Z siblings : 48 2025-05-07T19:43:01.1196541Z core id : 14 2025-05-07T19:43:01.1196626Z cpu cores : 24 2025-05-07T19:43:01.1196705Z apicid : 29 2025-05-07T19:43:01.1196783Z initial apicid : 29 2025-05-07T19:43:01.1196868Z fpu : yes 2025-05-07T19:43:01.1197007Z fpu_exception : yes 2025-05-07T19:43:01.1197087Z cpuid level : 13 2025-05-07T19:43:01.1197214Z wp : yes 2025-05-07T19:43:01.1199403Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1199785Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1199874Z bogomips : 5999.98 2025-05-07T19:43:01.1199953Z clflush size : 64 2025-05-07T19:43:01.1200036Z cache_alignment : 64 2025-05-07T19:43:01.1200164Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1200257Z power management: 2025-05-07T19:43:01.1200262Z 2025-05-07T19:43:01.1200339Z processor : 63 2025-05-07T19:43:01.1200430Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1200515Z cpu family : 6 2025-05-07T19:43:01.1200588Z model : 85 2025-05-07T19:43:01.1200744Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1200823Z stepping : 7 2025-05-07T19:43:01.1200915Z microcode : 0x5003901 2025-05-07T19:43:01.1200992Z cpu MHz : 3272.834 2025-05-07T19:43:01.1201073Z cache size : 36608 KB 2025-05-07T19:43:01.1201164Z physical id : 0 2025-05-07T19:43:01.1201239Z siblings : 48 2025-05-07T19:43:01.1201317Z core id : 15 2025-05-07T19:43:01.1201394Z cpu cores : 24 2025-05-07T19:43:01.1201476Z apicid : 31 2025-05-07T19:43:01.1201558Z initial apicid : 31 2025-05-07T19:43:01.1201637Z fpu : yes 2025-05-07T19:43:01.1201727Z fpu_exception : yes 2025-05-07T19:43:01.1201809Z cpuid level : 13 2025-05-07T19:43:01.1201883Z wp : yes 2025-05-07T19:43:01.1204017Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1204396Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1204477Z bogomips : 5999.98 2025-05-07T19:43:01.1204561Z clflush size : 64 2025-05-07T19:43:01.1204642Z cache_alignment : 64 2025-05-07T19:43:01.1204772Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1204855Z power management: 2025-05-07T19:43:01.1204859Z 2025-05-07T19:43:01.1204945Z processor : 64 2025-05-07T19:43:01.1205030Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1205108Z cpu family : 6 2025-05-07T19:43:01.1205194Z model : 85 2025-05-07T19:43:01.1205349Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1205427Z stepping : 7 2025-05-07T19:43:01.1205509Z microcode : 0x5003901 2025-05-07T19:43:01.1205596Z cpu MHz : 3209.507 2025-05-07T19:43:01.1205675Z cache size : 36608 KB 2025-05-07T19:43:01.1205754Z physical id : 0 2025-05-07T19:43:01.1205840Z siblings : 48 2025-05-07T19:43:01.1205917Z core id : 16 2025-05-07T19:43:01.1205995Z cpu cores : 24 2025-05-07T19:43:01.1206069Z apicid : 33 2025-05-07T19:43:01.1206159Z initial apicid : 33 2025-05-07T19:43:01.1206232Z fpu : yes 2025-05-07T19:43:01.1206315Z fpu_exception : yes 2025-05-07T19:43:01.1206402Z cpuid level : 13 2025-05-07T19:43:01.1206538Z wp : yes 2025-05-07T19:43:01.1208805Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1209168Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1209242Z bogomips : 5999.98 2025-05-07T19:43:01.1209315Z clflush size : 64 2025-05-07T19:43:01.1209401Z cache_alignment : 64 2025-05-07T19:43:01.1209518Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1209595Z power management: 2025-05-07T19:43:01.1209599Z 2025-05-07T19:43:01.1209672Z processor : 65 2025-05-07T19:43:01.1209767Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1209840Z cpu family : 6 2025-05-07T19:43:01.1209910Z model : 85 2025-05-07T19:43:01.1210062Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1210138Z stepping : 7 2025-05-07T19:43:01.1210216Z microcode : 0x5003901 2025-05-07T19:43:01.1210287Z cpu MHz : 3248.100 2025-05-07T19:43:01.1210371Z cache size : 36608 KB 2025-05-07T19:43:01.1210443Z physical id : 0 2025-05-07T19:43:01.1210514Z siblings : 48 2025-05-07T19:43:01.1210596Z core id : 17 2025-05-07T19:43:01.1210670Z cpu cores : 24 2025-05-07T19:43:01.1210741Z apicid : 35 2025-05-07T19:43:01.1210817Z initial apicid : 35 2025-05-07T19:43:01.1210894Z fpu : yes 2025-05-07T19:43:01.1210970Z fpu_exception : yes 2025-05-07T19:43:01.1211044Z cpuid level : 13 2025-05-07T19:43:01.1211112Z wp : yes 2025-05-07T19:43:01.1213085Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1213438Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1213520Z bogomips : 5999.98 2025-05-07T19:43:01.1213593Z clflush size : 64 2025-05-07T19:43:01.1213667Z cache_alignment : 64 2025-05-07T19:43:01.1213794Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1213869Z power management: 2025-05-07T19:43:01.1213876Z 2025-05-07T19:43:01.1213947Z processor : 66 2025-05-07T19:43:01.1214026Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1214104Z cpu family : 6 2025-05-07T19:43:01.1214176Z model : 85 2025-05-07T19:43:01.1214321Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1214402Z stepping : 7 2025-05-07T19:43:01.1214478Z microcode : 0x5003901 2025-05-07T19:43:01.1214549Z cpu MHz : 3273.000 2025-05-07T19:43:01.1214623Z cache size : 36608 KB 2025-05-07T19:43:01.1214704Z physical id : 0 2025-05-07T19:43:01.1214776Z siblings : 48 2025-05-07T19:43:01.1214847Z core id : 18 2025-05-07T19:43:01.1214926Z cpu cores : 24 2025-05-07T19:43:01.1214996Z apicid : 37 2025-05-07T19:43:01.1215071Z initial apicid : 37 2025-05-07T19:43:01.1215139Z fpu : yes 2025-05-07T19:43:01.1215224Z fpu_exception : yes 2025-05-07T19:43:01.1215297Z cpuid level : 13 2025-05-07T19:43:01.1215365Z wp : yes 2025-05-07T19:43:01.1217380Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1217774Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1217849Z bogomips : 5999.98 2025-05-07T19:43:01.1217929Z clflush size : 64 2025-05-07T19:43:01.1218004Z cache_alignment : 64 2025-05-07T19:43:01.1218119Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1218202Z power management: 2025-05-07T19:43:01.1218206Z 2025-05-07T19:43:01.1218281Z processor : 67 2025-05-07T19:43:01.1218361Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1218430Z cpu family : 6 2025-05-07T19:43:01.1218511Z model : 85 2025-05-07T19:43:01.1218661Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1218735Z stepping : 7 2025-05-07T19:43:01.1218823Z microcode : 0x5003901 2025-05-07T19:43:01.1218895Z cpu MHz : 2999.994 2025-05-07T19:43:01.1218966Z cache size : 36608 KB 2025-05-07T19:43:01.1219041Z physical id : 0 2025-05-07T19:43:01.1219120Z siblings : 48 2025-05-07T19:43:01.1219190Z core id : 19 2025-05-07T19:43:01.1219263Z cpu cores : 24 2025-05-07T19:43:01.1219336Z apicid : 39 2025-05-07T19:43:01.1219424Z initial apicid : 39 2025-05-07T19:43:01.1219494Z fpu : yes 2025-05-07T19:43:01.1219570Z fpu_exception : yes 2025-05-07T19:43:01.1219646Z cpuid level : 13 2025-05-07T19:43:01.1219713Z wp : yes 2025-05-07T19:43:01.1221675Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1222033Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1222108Z bogomips : 5999.98 2025-05-07T19:43:01.1222181Z clflush size : 64 2025-05-07T19:43:01.1222260Z cache_alignment : 64 2025-05-07T19:43:01.1222377Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1222452Z power management: 2025-05-07T19:43:01.1222456Z 2025-05-07T19:43:01.1222535Z processor : 68 2025-05-07T19:43:01.1222617Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1222688Z cpu family : 6 2025-05-07T19:43:01.1222761Z model : 85 2025-05-07T19:43:01.1222910Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1222979Z stepping : 7 2025-05-07T19:43:01.1223058Z microcode : 0x5003901 2025-05-07T19:43:01.1223134Z cpu MHz : 2999.994 2025-05-07T19:43:01.1223206Z cache size : 36608 KB 2025-05-07T19:43:01.1223278Z physical id : 0 2025-05-07T19:43:01.1223347Z siblings : 48 2025-05-07T19:43:01.1223421Z core id : 20 2025-05-07T19:43:01.1223492Z cpu cores : 24 2025-05-07T19:43:01.1223560Z apicid : 41 2025-05-07T19:43:01.1223635Z initial apicid : 41 2025-05-07T19:43:01.1223707Z fpu : yes 2025-05-07T19:43:01.1223783Z fpu_exception : yes 2025-05-07T19:43:01.1223852Z cpuid level : 13 2025-05-07T19:43:01.1223924Z wp : yes 2025-05-07T19:43:01.1225932Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1226331Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1226409Z bogomips : 5999.98 2025-05-07T19:43:01.1226481Z clflush size : 64 2025-05-07T19:43:01.1226557Z cache_alignment : 64 2025-05-07T19:43:01.1226680Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1226754Z power management: 2025-05-07T19:43:01.1226758Z 2025-05-07T19:43:01.1226830Z processor : 69 2025-05-07T19:43:01.1226912Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1226991Z cpu family : 6 2025-05-07T19:43:01.1227060Z model : 85 2025-05-07T19:43:01.1227202Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1227277Z stepping : 7 2025-05-07T19:43:01.1227351Z microcode : 0x5003901 2025-05-07T19:43:01.1227424Z cpu MHz : 3217.692 2025-05-07T19:43:01.1227500Z cache size : 36608 KB 2025-05-07T19:43:01.1227578Z physical id : 0 2025-05-07T19:43:01.1227645Z siblings : 48 2025-05-07T19:43:01.1227712Z core id : 21 2025-05-07T19:43:01.1227787Z cpu cores : 24 2025-05-07T19:43:01.1227855Z apicid : 43 2025-05-07T19:43:01.1227928Z initial apicid : 43 2025-05-07T19:43:01.1227996Z fpu : yes 2025-05-07T19:43:01.1228074Z fpu_exception : yes 2025-05-07T19:43:01.1228145Z cpuid level : 13 2025-05-07T19:43:01.1228212Z wp : yes 2025-05-07T19:43:01.1230176Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1230528Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1230600Z bogomips : 5999.98 2025-05-07T19:43:01.1230686Z clflush size : 64 2025-05-07T19:43:01.1230767Z cache_alignment : 64 2025-05-07T19:43:01.1230890Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1230984Z power management: 2025-05-07T19:43:01.1230988Z 2025-05-07T19:43:01.1231068Z processor : 70 2025-05-07T19:43:01.1231156Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1231236Z cpu family : 6 2025-05-07T19:43:01.1231329Z model : 85 2025-05-07T19:43:01.1231479Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1231558Z stepping : 7 2025-05-07T19:43:01.1231656Z microcode : 0x5003901 2025-05-07T19:43:01.1231733Z cpu MHz : 3261.895 2025-05-07T19:43:01.1231815Z cache size : 36608 KB 2025-05-07T19:43:01.1231895Z physical id : 0 2025-05-07T19:43:01.1231983Z siblings : 48 2025-05-07T19:43:01.1232058Z core id : 22 2025-05-07T19:43:01.1232135Z cpu cores : 24 2025-05-07T19:43:01.1232225Z apicid : 45 2025-05-07T19:43:01.1232307Z initial apicid : 45 2025-05-07T19:43:01.1232382Z fpu : yes 2025-05-07T19:43:01.1232466Z fpu_exception : yes 2025-05-07T19:43:01.1232556Z cpuid level : 13 2025-05-07T19:43:01.1232632Z wp : yes 2025-05-07T19:43:01.1234649Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1235422Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1235504Z bogomips : 5999.98 2025-05-07T19:43:01.1235586Z clflush size : 64 2025-05-07T19:43:01.1235684Z cache_alignment : 64 2025-05-07T19:43:01.1235810Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1235969Z power management: 2025-05-07T19:43:01.1235976Z 2025-05-07T19:43:01.1236071Z processor : 71 2025-05-07T19:43:01.1236159Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1236238Z cpu family : 6 2025-05-07T19:43:01.1236485Z model : 85 2025-05-07T19:43:01.1236666Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1236756Z stepping : 7 2025-05-07T19:43:01.1236846Z microcode : 0x5003901 2025-05-07T19:43:01.1236947Z cpu MHz : 3232.148 2025-05-07T19:43:01.1237035Z cache size : 36608 KB 2025-05-07T19:43:01.1237125Z physical id : 0 2025-05-07T19:43:01.1237280Z siblings : 48 2025-05-07T19:43:01.1237378Z core id : 23 2025-05-07T19:43:01.1237461Z cpu cores : 24 2025-05-07T19:43:01.1237544Z apicid : 47 2025-05-07T19:43:01.1237642Z initial apicid : 47 2025-05-07T19:43:01.1237723Z fpu : yes 2025-05-07T19:43:01.1237810Z fpu_exception : yes 2025-05-07T19:43:01.1237894Z cpuid level : 13 2025-05-07T19:43:01.1237986Z wp : yes 2025-05-07T19:43:01.1240120Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1240517Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1240602Z bogomips : 5999.98 2025-05-07T19:43:01.1240687Z clflush size : 64 2025-05-07T19:43:01.1240775Z cache_alignment : 64 2025-05-07T19:43:01.1240918Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1241005Z power management: 2025-05-07T19:43:01.1241010Z 2025-05-07T19:43:01.1241094Z processor : 72 2025-05-07T19:43:01.1241200Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1241283Z cpu family : 6 2025-05-07T19:43:01.1241364Z model : 85 2025-05-07T19:43:01.1241526Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1241625Z stepping : 7 2025-05-07T19:43:01.1241712Z microcode : 0x5003901 2025-05-07T19:43:01.1241796Z cpu MHz : 1199.794 2025-05-07T19:43:01.1241895Z cache size : 36608 KB 2025-05-07T19:43:01.1241979Z physical id : 1 2025-05-07T19:43:01.1242065Z siblings : 48 2025-05-07T19:43:01.1242144Z core id : 0 2025-05-07T19:43:01.1242239Z cpu cores : 24 2025-05-07T19:43:01.1242321Z apicid : 65 2025-05-07T19:43:01.1242410Z initial apicid : 65 2025-05-07T19:43:01.1242503Z fpu : yes 2025-05-07T19:43:01.1242590Z fpu_exception : yes 2025-05-07T19:43:01.1242674Z cpuid level : 13 2025-05-07T19:43:01.1242754Z wp : yes 2025-05-07T19:43:01.1244892Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1245402Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1245501Z bogomips : 5999.98 2025-05-07T19:43:01.1245586Z clflush size : 64 2025-05-07T19:43:01.1245674Z cache_alignment : 64 2025-05-07T19:43:01.1245805Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1245905Z power management: 2025-05-07T19:43:01.1245909Z 2025-05-07T19:43:01.1245993Z processor : 73 2025-05-07T19:43:01.1246084Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1246178Z cpu family : 6 2025-05-07T19:43:01.1246258Z model : 85 2025-05-07T19:43:01.1246590Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1246675Z stepping : 7 2025-05-07T19:43:01.1246780Z microcode : 0x5003901 2025-05-07T19:43:01.1246863Z cpu MHz : 2999.994 2025-05-07T19:43:01.1246947Z cache size : 36608 KB 2025-05-07T19:43:01.1247046Z physical id : 1 2025-05-07T19:43:01.1247126Z siblings : 48 2025-05-07T19:43:01.1247211Z core id : 1 2025-05-07T19:43:01.1247293Z cpu cores : 24 2025-05-07T19:43:01.1247388Z apicid : 67 2025-05-07T19:43:01.1247476Z initial apicid : 67 2025-05-07T19:43:01.1247557Z fpu : yes 2025-05-07T19:43:01.1247645Z fpu_exception : yes 2025-05-07T19:43:01.1247748Z cpuid level : 13 2025-05-07T19:43:01.1247830Z wp : yes 2025-05-07T19:43:01.1249964Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1250371Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1250460Z bogomips : 5999.98 2025-05-07T19:43:01.1250545Z clflush size : 64 2025-05-07T19:43:01.1250649Z cache_alignment : 64 2025-05-07T19:43:01.1250782Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1250874Z power management: 2025-05-07T19:43:01.1250879Z 2025-05-07T19:43:01.1250977Z processor : 74 2025-05-07T19:43:01.1251069Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1251156Z cpu family : 6 2025-05-07T19:43:01.1251249Z model : 85 2025-05-07T19:43:01.1251411Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1251494Z stepping : 7 2025-05-07T19:43:01.1251582Z microcode : 0x5003901 2025-05-07T19:43:01.1251681Z cpu MHz : 2456.276 2025-05-07T19:43:01.1251766Z cache size : 36608 KB 2025-05-07T19:43:01.1251851Z physical id : 1 2025-05-07T19:43:01.1251931Z siblings : 48 2025-05-07T19:43:01.1252025Z core id : 2 2025-05-07T19:43:01.1252110Z cpu cores : 24 2025-05-07T19:43:01.1252191Z apicid : 69 2025-05-07T19:43:01.1252291Z initial apicid : 69 2025-05-07T19:43:01.1252369Z fpu : yes 2025-05-07T19:43:01.1252456Z fpu_exception : yes 2025-05-07T19:43:01.1252538Z cpuid level : 13 2025-05-07T19:43:01.1252623Z wp : yes 2025-05-07T19:43:01.1254750Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1255292Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1255374Z bogomips : 5999.98 2025-05-07T19:43:01.1255454Z clflush size : 64 2025-05-07T19:43:01.1255536Z cache_alignment : 64 2025-05-07T19:43:01.1255670Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1255752Z power management: 2025-05-07T19:43:01.1255756Z 2025-05-07T19:43:01.1255834Z processor : 75 2025-05-07T19:43:01.1255925Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1256002Z cpu family : 6 2025-05-07T19:43:01.1256078Z model : 85 2025-05-07T19:43:01.1256241Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1256320Z stepping : 7 2025-05-07T19:43:01.1256402Z microcode : 0x5003901 2025-05-07T19:43:01.1256480Z cpu MHz : 1199.524 2025-05-07T19:43:01.1256566Z cache size : 36608 KB 2025-05-07T19:43:01.1256646Z physical id : 1 2025-05-07T19:43:01.1256723Z siblings : 48 2025-05-07T19:43:01.1256798Z core id : 3 2025-05-07T19:43:01.1256881Z cpu cores : 24 2025-05-07T19:43:01.1256953Z apicid : 71 2025-05-07T19:43:01.1257037Z initial apicid : 71 2025-05-07T19:43:01.1257113Z fpu : yes 2025-05-07T19:43:01.1257195Z fpu_exception : yes 2025-05-07T19:43:01.1257273Z cpuid level : 13 2025-05-07T19:43:01.1257348Z wp : yes 2025-05-07T19:43:01.1259605Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1259958Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1260042Z bogomips : 5999.98 2025-05-07T19:43:01.1260115Z clflush size : 64 2025-05-07T19:43:01.1260191Z cache_alignment : 64 2025-05-07T19:43:01.1260307Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1260389Z power management: 2025-05-07T19:43:01.1260393Z 2025-05-07T19:43:01.1260466Z processor : 76 2025-05-07T19:43:01.1260544Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1260622Z cpu family : 6 2025-05-07T19:43:01.1260693Z model : 85 2025-05-07T19:43:01.1260836Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1260906Z stepping : 7 2025-05-07T19:43:01.1260992Z microcode : 0x5003901 2025-05-07T19:43:01.1261063Z cpu MHz : 2999.994 2025-05-07T19:43:01.1261136Z cache size : 36608 KB 2025-05-07T19:43:01.1261217Z physical id : 1 2025-05-07T19:43:01.1261290Z siblings : 48 2025-05-07T19:43:01.1261360Z core id : 4 2025-05-07T19:43:01.1261431Z cpu cores : 24 2025-05-07T19:43:01.1261506Z apicid : 73 2025-05-07T19:43:01.1261579Z initial apicid : 73 2025-05-07T19:43:01.1261650Z fpu : yes 2025-05-07T19:43:01.1261728Z fpu_exception : yes 2025-05-07T19:43:01.1261801Z cpuid level : 13 2025-05-07T19:43:01.1261868Z wp : yes 2025-05-07T19:43:01.1263977Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1264370Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1264443Z bogomips : 5999.98 2025-05-07T19:43:01.1264565Z clflush size : 64 2025-05-07T19:43:01.1264640Z cache_alignment : 64 2025-05-07T19:43:01.1264757Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1264830Z power management: 2025-05-07T19:43:01.1264834Z 2025-05-07T19:43:01.1264912Z processor : 77 2025-05-07T19:43:01.1264990Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1265061Z cpu family : 6 2025-05-07T19:43:01.1265134Z model : 85 2025-05-07T19:43:01.1265275Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1265344Z stepping : 7 2025-05-07T19:43:01.1265416Z microcode : 0x5003901 2025-05-07T19:43:01.1265492Z cpu MHz : 2999.994 2025-05-07T19:43:01.1265565Z cache size : 36608 KB 2025-05-07T19:43:01.1265635Z physical id : 1 2025-05-07T19:43:01.1265715Z siblings : 48 2025-05-07T19:43:01.1265783Z core id : 5 2025-05-07T19:43:01.1265854Z cpu cores : 24 2025-05-07T19:43:01.1265921Z apicid : 75 2025-05-07T19:43:01.1266001Z initial apicid : 75 2025-05-07T19:43:01.1266069Z fpu : yes 2025-05-07T19:43:01.1266147Z fpu_exception : yes 2025-05-07T19:43:01.1266224Z cpuid level : 13 2025-05-07T19:43:01.1266293Z wp : yes 2025-05-07T19:43:01.1268249Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1268610Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1268685Z bogomips : 5999.98 2025-05-07T19:43:01.1268757Z clflush size : 64 2025-05-07T19:43:01.1268837Z cache_alignment : 64 2025-05-07T19:43:01.1268952Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1269026Z power management: 2025-05-07T19:43:01.1269031Z 2025-05-07T19:43:01.1269103Z processor : 78 2025-05-07T19:43:01.1269188Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1269258Z cpu family : 6 2025-05-07T19:43:01.1269329Z model : 85 2025-05-07T19:43:01.1269477Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1269548Z stepping : 7 2025-05-07T19:43:01.1269624Z microcode : 0x5003901 2025-05-07T19:43:01.1269696Z cpu MHz : 2999.994 2025-05-07T19:43:01.1269775Z cache size : 36608 KB 2025-05-07T19:43:01.1269848Z physical id : 1 2025-05-07T19:43:01.1269920Z siblings : 48 2025-05-07T19:43:01.1269995Z core id : 6 2025-05-07T19:43:01.1270069Z cpu cores : 24 2025-05-07T19:43:01.1270137Z apicid : 77 2025-05-07T19:43:01.1270210Z initial apicid : 77 2025-05-07T19:43:01.1270286Z fpu : yes 2025-05-07T19:43:01.1270359Z fpu_exception : yes 2025-05-07T19:43:01.1270434Z cpuid level : 13 2025-05-07T19:43:01.1270503Z wp : yes 2025-05-07T19:43:01.1272467Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1272815Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1272943Z bogomips : 5999.98 2025-05-07T19:43:01.1273016Z clflush size : 64 2025-05-07T19:43:01.1273093Z cache_alignment : 64 2025-05-07T19:43:01.1273273Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1273350Z power management: 2025-05-07T19:43:01.1273354Z 2025-05-07T19:43:01.1273426Z processor : 79 2025-05-07T19:43:01.1273506Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1273580Z cpu family : 6 2025-05-07T19:43:01.1273649Z model : 85 2025-05-07T19:43:01.1273790Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1273868Z stepping : 7 2025-05-07T19:43:01.1273942Z microcode : 0x5003901 2025-05-07T19:43:01.1274015Z cpu MHz : 1200.548 2025-05-07T19:43:01.1274090Z cache size : 36608 KB 2025-05-07T19:43:01.1274172Z physical id : 1 2025-05-07T19:43:01.1274243Z siblings : 48 2025-05-07T19:43:01.1274313Z core id : 7 2025-05-07T19:43:01.1274391Z cpu cores : 24 2025-05-07T19:43:01.1274465Z apicid : 79 2025-05-07T19:43:01.1274542Z initial apicid : 79 2025-05-07T19:43:01.1274610Z fpu : yes 2025-05-07T19:43:01.1274694Z fpu_exception : yes 2025-05-07T19:43:01.1274764Z cpuid level : 13 2025-05-07T19:43:01.1274834Z wp : yes 2025-05-07T19:43:01.1277078Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1277457Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1277541Z bogomips : 5999.98 2025-05-07T19:43:01.1277628Z clflush size : 64 2025-05-07T19:43:01.1277709Z cache_alignment : 64 2025-05-07T19:43:01.1277833Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1277924Z power management: 2025-05-07T19:43:01.1277929Z 2025-05-07T19:43:01.1278006Z processor : 80 2025-05-07T19:43:01.1278090Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1278166Z cpu family : 6 2025-05-07T19:43:01.1278250Z model : 85 2025-05-07T19:43:01.1278403Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1278481Z stepping : 7 2025-05-07T19:43:01.1278569Z microcode : 0x5003901 2025-05-07T19:43:01.1278645Z cpu MHz : 2999.994 2025-05-07T19:43:01.1278724Z cache size : 36608 KB 2025-05-07T19:43:01.1278801Z physical id : 1 2025-05-07T19:43:01.1278884Z siblings : 48 2025-05-07T19:43:01.1278961Z core id : 8 2025-05-07T19:43:01.1279036Z cpu cores : 24 2025-05-07T19:43:01.1279110Z apicid : 81 2025-05-07T19:43:01.1279197Z initial apicid : 81 2025-05-07T19:43:01.1279272Z fpu : yes 2025-05-07T19:43:01.1279353Z fpu_exception : yes 2025-05-07T19:43:01.1279434Z cpuid level : 13 2025-05-07T19:43:01.1279507Z wp : yes 2025-05-07T19:43:01.1281630Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1282013Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1282149Z bogomips : 5999.98 2025-05-07T19:43:01.1282227Z clflush size : 64 2025-05-07T19:43:01.1282313Z cache_alignment : 64 2025-05-07T19:43:01.1282438Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1282516Z power management: 2025-05-07T19:43:01.1282567Z 2025-05-07T19:43:01.1282650Z processor : 81 2025-05-07T19:43:01.1282734Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1282808Z cpu family : 6 2025-05-07T19:43:01.1282882Z model : 85 2025-05-07T19:43:01.1283038Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1283116Z stepping : 7 2025-05-07T19:43:01.1283197Z microcode : 0x5003901 2025-05-07T19:43:01.1283273Z cpu MHz : 1201.216 2025-05-07T19:43:01.1283361Z cache size : 36608 KB 2025-05-07T19:43:01.1283438Z physical id : 1 2025-05-07T19:43:01.1283513Z siblings : 48 2025-05-07T19:43:01.1283593Z core id : 9 2025-05-07T19:43:01.1283668Z cpu cores : 24 2025-05-07T19:43:01.1283743Z apicid : 83 2025-05-07T19:43:01.1283821Z initial apicid : 83 2025-05-07T19:43:01.1283904Z fpu : yes 2025-05-07T19:43:01.1283985Z fpu_exception : yes 2025-05-07T19:43:01.1284063Z cpuid level : 13 2025-05-07T19:43:01.1284140Z wp : yes 2025-05-07T19:43:01.1286262Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1286639Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1286726Z bogomips : 5999.98 2025-05-07T19:43:01.1286807Z clflush size : 64 2025-05-07T19:43:01.1286887Z cache_alignment : 64 2025-05-07T19:43:01.1287023Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1287106Z power management: 2025-05-07T19:43:01.1287110Z 2025-05-07T19:43:01.1287192Z processor : 82 2025-05-07T19:43:01.1287279Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1287368Z cpu family : 6 2025-05-07T19:43:01.1287443Z model : 85 2025-05-07T19:43:01.1287598Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1287688Z stepping : 7 2025-05-07T19:43:01.1287770Z microcode : 0x5003901 2025-05-07T19:43:01.1287847Z cpu MHz : 1200.262 2025-05-07T19:43:01.1287926Z cache size : 36608 KB 2025-05-07T19:43:01.1288013Z physical id : 1 2025-05-07T19:43:01.1288087Z siblings : 48 2025-05-07T19:43:01.1288165Z core id : 10 2025-05-07T19:43:01.1288252Z cpu cores : 24 2025-05-07T19:43:01.1288332Z apicid : 85 2025-05-07T19:43:01.1288415Z initial apicid : 85 2025-05-07T19:43:01.1288492Z fpu : yes 2025-05-07T19:43:01.1288693Z fpu_exception : yes 2025-05-07T19:43:01.1288769Z cpuid level : 13 2025-05-07T19:43:01.1288839Z wp : yes 2025-05-07T19:43:01.1290807Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1291156Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1291229Z bogomips : 5999.98 2025-05-07T19:43:01.1291315Z clflush size : 64 2025-05-07T19:43:01.1291438Z cache_alignment : 64 2025-05-07T19:43:01.1291553Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1291637Z power management: 2025-05-07T19:43:01.1291641Z 2025-05-07T19:43:01.1291715Z processor : 83 2025-05-07T19:43:01.1291840Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1291912Z cpu family : 6 2025-05-07T19:43:01.1291990Z model : 85 2025-05-07T19:43:01.1292135Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1292206Z stepping : 7 2025-05-07T19:43:01.1292296Z microcode : 0x5003901 2025-05-07T19:43:01.1292368Z cpu MHz : 1199.931 2025-05-07T19:43:01.1292439Z cache size : 36608 KB 2025-05-07T19:43:01.1292511Z physical id : 1 2025-05-07T19:43:01.1292590Z siblings : 48 2025-05-07T19:43:01.1292663Z core id : 11 2025-05-07T19:43:01.1292731Z cpu cores : 24 2025-05-07T19:43:01.1292808Z apicid : 87 2025-05-07T19:43:01.1292889Z initial apicid : 87 2025-05-07T19:43:01.1292963Z fpu : yes 2025-05-07T19:43:01.1293037Z fpu_exception : yes 2025-05-07T19:43:01.1293129Z cpuid level : 13 2025-05-07T19:43:01.1293203Z wp : yes 2025-05-07T19:43:01.1295168Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1295528Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1295602Z bogomips : 5999.98 2025-05-07T19:43:01.1295677Z clflush size : 64 2025-05-07T19:43:01.1295761Z cache_alignment : 64 2025-05-07T19:43:01.1295881Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1295962Z power management: 2025-05-07T19:43:01.1295966Z 2025-05-07T19:43:01.1296049Z processor : 84 2025-05-07T19:43:01.1296128Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1296201Z cpu family : 6 2025-05-07T19:43:01.1296270Z model : 85 2025-05-07T19:43:01.1296423Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1296493Z stepping : 7 2025-05-07T19:43:01.1296570Z microcode : 0x5003901 2025-05-07T19:43:01.1296651Z cpu MHz : 1199.472 2025-05-07T19:43:01.1296724Z cache size : 36608 KB 2025-05-07T19:43:01.1296799Z physical id : 1 2025-05-07T19:43:01.1296875Z siblings : 48 2025-05-07T19:43:01.1296960Z core id : 12 2025-05-07T19:43:01.1297037Z cpu cores : 24 2025-05-07T19:43:01.1297108Z apicid : 89 2025-05-07T19:43:01.1297195Z initial apicid : 89 2025-05-07T19:43:01.1297263Z fpu : yes 2025-05-07T19:43:01.1297339Z fpu_exception : yes 2025-05-07T19:43:01.1297413Z cpuid level : 13 2025-05-07T19:43:01.1297491Z wp : yes 2025-05-07T19:43:01.1299457Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1299812Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1299882Z bogomips : 5999.98 2025-05-07T19:43:01.1299954Z clflush size : 64 2025-05-07T19:43:01.1300027Z cache_alignment : 64 2025-05-07T19:43:01.1300150Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1300269Z power management: 2025-05-07T19:43:01.1300273Z 2025-05-07T19:43:01.1300352Z processor : 85 2025-05-07T19:43:01.1300438Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1300511Z cpu family : 6 2025-05-07T19:43:01.1300579Z model : 85 2025-05-07T19:43:01.1300805Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1300882Z stepping : 7 2025-05-07T19:43:01.1300956Z microcode : 0x5003901 2025-05-07T19:43:01.1301026Z cpu MHz : 2999.994 2025-05-07T19:43:01.1301106Z cache size : 36608 KB 2025-05-07T19:43:01.1301177Z physical id : 1 2025-05-07T19:43:01.1301245Z siblings : 48 2025-05-07T19:43:01.1301313Z core id : 13 2025-05-07T19:43:01.1301390Z cpu cores : 24 2025-05-07T19:43:01.1301459Z apicid : 91 2025-05-07T19:43:01.1301535Z initial apicid : 91 2025-05-07T19:43:01.1301609Z fpu : yes 2025-05-07T19:43:01.1301685Z fpu_exception : yes 2025-05-07T19:43:01.1301754Z cpuid level : 13 2025-05-07T19:43:01.1301823Z wp : yes 2025-05-07T19:43:01.1303791Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1304145Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1304220Z bogomips : 5999.98 2025-05-07T19:43:01.1304294Z clflush size : 64 2025-05-07T19:43:01.1304368Z cache_alignment : 64 2025-05-07T19:43:01.1304483Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1304560Z power management: 2025-05-07T19:43:01.1304567Z 2025-05-07T19:43:01.1304640Z processor : 86 2025-05-07T19:43:01.1304720Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1304801Z cpu family : 6 2025-05-07T19:43:01.1304870Z model : 85 2025-05-07T19:43:01.1305014Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1305087Z stepping : 7 2025-05-07T19:43:01.1305167Z microcode : 0x5003901 2025-05-07T19:43:01.1305240Z cpu MHz : 2999.994 2025-05-07T19:43:01.1305312Z cache size : 36608 KB 2025-05-07T19:43:01.1305390Z physical id : 1 2025-05-07T19:43:01.1305460Z siblings : 48 2025-05-07T19:43:01.1305530Z core id : 14 2025-05-07T19:43:01.1305600Z cpu cores : 24 2025-05-07T19:43:01.1305677Z apicid : 93 2025-05-07T19:43:01.1305753Z initial apicid : 93 2025-05-07T19:43:01.1305823Z fpu : yes 2025-05-07T19:43:01.1305900Z fpu_exception : yes 2025-05-07T19:43:01.1305979Z cpuid level : 13 2025-05-07T19:43:01.1306048Z wp : yes 2025-05-07T19:43:01.1308010Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1308373Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1308444Z bogomips : 5999.98 2025-05-07T19:43:01.1308516Z clflush size : 64 2025-05-07T19:43:01.1308596Z cache_alignment : 64 2025-05-07T19:43:01.1308711Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1308789Z power management: 2025-05-07T19:43:01.1308793Z 2025-05-07T19:43:01.1308874Z processor : 87 2025-05-07T19:43:01.1309000Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1309073Z cpu family : 6 2025-05-07T19:43:01.1309149Z model : 85 2025-05-07T19:43:01.1309291Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1309418Z stepping : 7 2025-05-07T19:43:01.1309496Z microcode : 0x5003901 2025-05-07T19:43:01.1309579Z cpu MHz : 1199.554 2025-05-07T19:43:01.1309653Z cache size : 36608 KB 2025-05-07T19:43:01.1309731Z physical id : 1 2025-05-07T19:43:01.1309804Z siblings : 48 2025-05-07T19:43:01.1309884Z core id : 15 2025-05-07T19:43:01.1309960Z cpu cores : 24 2025-05-07T19:43:01.1310033Z apicid : 95 2025-05-07T19:43:01.1310117Z initial apicid : 95 2025-05-07T19:43:01.1310192Z fpu : yes 2025-05-07T19:43:01.1310272Z fpu_exception : yes 2025-05-07T19:43:01.1310347Z cpuid level : 13 2025-05-07T19:43:01.1310425Z wp : yes 2025-05-07T19:43:01.1312394Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1312752Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1312830Z bogomips : 5999.98 2025-05-07T19:43:01.1312903Z clflush size : 64 2025-05-07T19:43:01.1312983Z cache_alignment : 64 2025-05-07T19:43:01.1313112Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1313187Z power management: 2025-05-07T19:43:01.1313191Z 2025-05-07T19:43:01.1313269Z processor : 88 2025-05-07T19:43:01.1313359Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1313435Z cpu family : 6 2025-05-07T19:43:01.1313506Z model : 85 2025-05-07T19:43:01.1313650Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1313733Z stepping : 7 2025-05-07T19:43:01.1313812Z microcode : 0x5003901 2025-05-07T19:43:01.1313884Z cpu MHz : 1200.288 2025-05-07T19:43:01.1313967Z cache size : 36608 KB 2025-05-07T19:43:01.1314040Z physical id : 1 2025-05-07T19:43:01.1314115Z siblings : 48 2025-05-07T19:43:01.1314187Z core id : 16 2025-05-07T19:43:01.1314267Z cpu cores : 24 2025-05-07T19:43:01.1314337Z apicid : 97 2025-05-07T19:43:01.1314419Z initial apicid : 97 2025-05-07T19:43:01.1314499Z fpu : yes 2025-05-07T19:43:01.1314576Z fpu_exception : yes 2025-05-07T19:43:01.1314650Z cpuid level : 13 2025-05-07T19:43:01.1314719Z wp : yes 2025-05-07T19:43:01.1316984Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1317375Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1317467Z bogomips : 5999.98 2025-05-07T19:43:01.1317547Z clflush size : 64 2025-05-07T19:43:01.1317639Z cache_alignment : 64 2025-05-07T19:43:01.1317770Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1317862Z power management: 2025-05-07T19:43:01.1317867Z 2025-05-07T19:43:01.1317945Z processor : 89 2025-05-07T19:43:01.1318042Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1318132Z cpu family : 6 2025-05-07T19:43:01.1318264Z model : 85 2025-05-07T19:43:01.1318422Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1318500Z stepping : 7 2025-05-07T19:43:01.1318591Z microcode : 0x5003901 2025-05-07T19:43:01.1318715Z cpu MHz : 1199.393 2025-05-07T19:43:01.1318799Z cache size : 36608 KB 2025-05-07T19:43:01.1318886Z physical id : 1 2025-05-07T19:43:01.1318964Z siblings : 48 2025-05-07T19:43:01.1319043Z core id : 17 2025-05-07T19:43:01.1319125Z cpu cores : 24 2025-05-07T19:43:01.1319214Z apicid : 99 2025-05-07T19:43:01.1319294Z initial apicid : 99 2025-05-07T19:43:01.1319370Z fpu : yes 2025-05-07T19:43:01.1319457Z fpu_exception : yes 2025-05-07T19:43:01.1319538Z cpuid level : 13 2025-05-07T19:43:01.1319615Z wp : yes 2025-05-07T19:43:01.1321747Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1322131Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1322217Z bogomips : 5999.98 2025-05-07T19:43:01.1322308Z clflush size : 64 2025-05-07T19:43:01.1322390Z cache_alignment : 64 2025-05-07T19:43:01.1322517Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1322604Z power management: 2025-05-07T19:43:01.1322608Z 2025-05-07T19:43:01.1322700Z processor : 90 2025-05-07T19:43:01.1322789Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1322868Z cpu family : 6 2025-05-07T19:43:01.1322956Z model : 85 2025-05-07T19:43:01.1323120Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1323198Z stepping : 7 2025-05-07T19:43:01.1323284Z microcode : 0x5003901 2025-05-07T19:43:01.1323376Z cpu MHz : 1199.268 2025-05-07T19:43:01.1323460Z cache size : 36608 KB 2025-05-07T19:43:01.1323545Z physical id : 1 2025-05-07T19:43:01.1323644Z siblings : 48 2025-05-07T19:43:01.1323723Z core id : 18 2025-05-07T19:43:01.1323802Z cpu cores : 24 2025-05-07T19:43:01.1323884Z apicid : 101 2025-05-07T19:43:01.1323980Z initial apicid : 101 2025-05-07T19:43:01.1324062Z fpu : yes 2025-05-07T19:43:01.1324151Z fpu_exception : yes 2025-05-07T19:43:01.1324244Z cpuid level : 13 2025-05-07T19:43:01.1324328Z wp : yes 2025-05-07T19:43:01.1326456Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1326851Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1326939Z bogomips : 5999.98 2025-05-07T19:43:01.1327027Z clflush size : 64 2025-05-07T19:43:01.1327117Z cache_alignment : 64 2025-05-07T19:43:01.1327249Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1327330Z power management: 2025-05-07T19:43:01.1327335Z 2025-05-07T19:43:01.1327419Z processor : 91 2025-05-07T19:43:01.1327516Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1327598Z cpu family : 6 2025-05-07T19:43:01.1327677Z model : 85 2025-05-07T19:43:01.1327849Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1327984Z stepping : 7 2025-05-07T19:43:01.1328071Z microcode : 0x5003901 2025-05-07T19:43:01.1328158Z cpu MHz : 1200.093 2025-05-07T19:43:01.1328251Z cache size : 36608 KB 2025-05-07T19:43:01.1328377Z physical id : 1 2025-05-07T19:43:01.1328457Z siblings : 48 2025-05-07T19:43:01.1328645Z core id : 19 2025-05-07T19:43:01.1328716Z cpu cores : 24 2025-05-07T19:43:01.1328790Z apicid : 103 2025-05-07T19:43:01.1328866Z initial apicid : 103 2025-05-07T19:43:01.1328945Z fpu : yes 2025-05-07T19:43:01.1329024Z fpu_exception : yes 2025-05-07T19:43:01.1329098Z cpuid level : 13 2025-05-07T19:43:01.1329166Z wp : yes 2025-05-07T19:43:01.1331146Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1331504Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1331592Z bogomips : 5999.98 2025-05-07T19:43:01.1331668Z clflush size : 64 2025-05-07T19:43:01.1331745Z cache_alignment : 64 2025-05-07T19:43:01.1331870Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1331952Z power management: 2025-05-07T19:43:01.1331956Z 2025-05-07T19:43:01.1332031Z processor : 92 2025-05-07T19:43:01.1332111Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1332196Z cpu family : 6 2025-05-07T19:43:01.1332268Z model : 85 2025-05-07T19:43:01.1332411Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1332507Z stepping : 7 2025-05-07T19:43:01.1332583Z microcode : 0x5003901 2025-05-07T19:43:01.1332659Z cpu MHz : 1199.409 2025-05-07T19:43:01.1332734Z cache size : 36608 KB 2025-05-07T19:43:01.1332823Z physical id : 1 2025-05-07T19:43:01.1332898Z siblings : 48 2025-05-07T19:43:01.1332967Z core id : 20 2025-05-07T19:43:01.1333052Z cpu cores : 24 2025-05-07T19:43:01.1333121Z apicid : 105 2025-05-07T19:43:01.1333201Z initial apicid : 105 2025-05-07T19:43:01.1333273Z fpu : yes 2025-05-07T19:43:01.1333368Z fpu_exception : yes 2025-05-07T19:43:01.1333442Z cpuid level : 13 2025-05-07T19:43:01.1333515Z wp : yes 2025-05-07T19:43:01.1335494Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1335849Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1335929Z bogomips : 5999.98 2025-05-07T19:43:01.1336013Z clflush size : 64 2025-05-07T19:43:01.1336097Z cache_alignment : 64 2025-05-07T19:43:01.1336216Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1336298Z power management: 2025-05-07T19:43:01.1336302Z 2025-05-07T19:43:01.1336376Z processor : 93 2025-05-07T19:43:01.1336454Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1336524Z cpu family : 6 2025-05-07T19:43:01.1336610Z model : 85 2025-05-07T19:43:01.1336755Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1336827Z stepping : 7 2025-05-07T19:43:01.1336958Z microcode : 0x5003901 2025-05-07T19:43:01.1337034Z cpu MHz : 1201.657 2025-05-07T19:43:01.1337109Z cache size : 36608 KB 2025-05-07T19:43:01.1337186Z physical id : 1 2025-05-07T19:43:01.1337273Z siblings : 48 2025-05-07T19:43:01.1337350Z core id : 21 2025-05-07T19:43:01.1337467Z cpu cores : 24 2025-05-07T19:43:01.1337538Z apicid : 107 2025-05-07T19:43:01.1337629Z initial apicid : 107 2025-05-07T19:43:01.1337699Z fpu : yes 2025-05-07T19:43:01.1337775Z fpu_exception : yes 2025-05-07T19:43:01.1337859Z cpuid level : 13 2025-05-07T19:43:01.1337934Z wp : yes 2025-05-07T19:43:01.1339896Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1340264Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1340339Z bogomips : 5999.98 2025-05-07T19:43:01.1340413Z clflush size : 64 2025-05-07T19:43:01.1340504Z cache_alignment : 64 2025-05-07T19:43:01.1340630Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1340709Z power management: 2025-05-07T19:43:01.1340713Z 2025-05-07T19:43:01.1340809Z processor : 94 2025-05-07T19:43:01.1340893Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1340966Z cpu family : 6 2025-05-07T19:43:01.1341040Z model : 85 2025-05-07T19:43:01.1341194Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1341267Z stepping : 7 2025-05-07T19:43:01.1341345Z microcode : 0x5003901 2025-05-07T19:43:01.1341431Z cpu MHz : 1527.262 2025-05-07T19:43:01.1341506Z cache size : 36608 KB 2025-05-07T19:43:01.1341586Z physical id : 1 2025-05-07T19:43:01.1341657Z siblings : 48 2025-05-07T19:43:01.1341738Z core id : 22 2025-05-07T19:43:01.1341809Z cpu cores : 24 2025-05-07T19:43:01.1341886Z apicid : 109 2025-05-07T19:43:01.1341967Z initial apicid : 109 2025-05-07T19:43:01.1342045Z fpu : yes 2025-05-07T19:43:01.1342120Z fpu_exception : yes 2025-05-07T19:43:01.1342195Z cpuid level : 13 2025-05-07T19:43:01.1342271Z wp : yes 2025-05-07T19:43:01.1344232Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1344593Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1344681Z bogomips : 5999.98 2025-05-07T19:43:01.1344752Z clflush size : 64 2025-05-07T19:43:01.1344828Z cache_alignment : 64 2025-05-07T19:43:01.1344963Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1345038Z power management: 2025-05-07T19:43:01.1345042Z 2025-05-07T19:43:01.1345115Z processor : 95 2025-05-07T19:43:01.1345217Z vendor_id : GenuineIntel 2025-05-07T19:43:01.1345294Z cpu family : 6 2025-05-07T19:43:01.1345365Z model : 85 2025-05-07T19:43:01.1345509Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:01.1345592Z stepping : 7 2025-05-07T19:43:01.1345678Z microcode : 0x5003901 2025-05-07T19:43:01.1345750Z cpu MHz : 1199.974 2025-05-07T19:43:01.1345891Z cache size : 36608 KB 2025-05-07T19:43:01.1345974Z physical id : 1 2025-05-07T19:43:01.1346047Z siblings : 48 2025-05-07T19:43:01.1346116Z core id : 23 2025-05-07T19:43:01.1346201Z cpu cores : 24 2025-05-07T19:43:01.1346271Z apicid : 111 2025-05-07T19:43:01.1346525Z initial apicid : 111 2025-05-07T19:43:01.1346600Z fpu : yes 2025-05-07T19:43:01.1346683Z fpu_exception : yes 2025-05-07T19:43:01.1346756Z cpuid level : 13 2025-05-07T19:43:01.1346989Z wp : yes 2025-05-07T19:43:01.1349127Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:01.1349512Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:01.1349593Z bogomips : 5999.98 2025-05-07T19:43:01.1349682Z clflush size : 64 2025-05-07T19:43:01.1349763Z cache_alignment : 64 2025-05-07T19:43:01.1349888Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:01.1349979Z power management: 2025-05-07T19:43:01.1349983Z 2025-05-07T19:43:01.1349987Z 2025-05-07T19:43:01.1350097Z ################################################################################ 2025-05-07T19:43:01.1350186Z [INFO] Print PCI info ... 2025-05-07T19:43:01.1350277Z + lspci -v 2025-05-07T19:43:01.1350281Z 2025-05-07T19:43:01.1350461Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:01.1350568Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:01.1350681Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:01.1350697Z 2025-05-07T19:43:01.1350896Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:01.1350979Z Physical Slot: 1 2025-05-07T19:43:01.1351092Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:01.1351096Z 2025-05-07T19:43:01.1351358Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:01.1351439Z Physical Slot: 1 2025-05-07T19:43:01.1351565Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:01.1351569Z 2025-05-07T19:43:01.1351846Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:01.1351926Z Physical Slot: 3 2025-05-07T19:43:01.1352033Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:01.1352171Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:01.1352294Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:01.1352298Z 2025-05-07T19:43:01.1352611Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:01.1352722Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:01.1352801Z Physical Slot: 4 2025-05-07T19:43:01.1352931Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:01.1353082Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:01.1353183Z Capabilities: 2025-05-07T19:43:01.1353271Z Kernel driver in use: nvme 2025-05-07T19:43:01.1353275Z 2025-05-07T19:43:01.1353488Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:01.1353574Z Physical Slot: 5 2025-05-07T19:43:01.1353678Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:01.1353825Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:01.1353955Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:01.1354108Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:01.1354813Z Capabilities: 2025-05-07T19:43:01.1354898Z Kernel driver in use: ena 2025-05-07T19:43:01.1354903Z 2025-05-07T19:43:01.1354906Z 2025-05-07T19:43:01.1355091Z ################################################################################ 2025-05-07T19:43:01.1355201Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:01.1355277Z + uname -a 2025-05-07T19:43:01.1355282Z 2025-05-07T19:43:01.1355679Z Linux 12a11cea79f2 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:01.1355684Z 2025-05-07T19:43:01.1355759Z + uname -m 2025-05-07T19:43:01.1355764Z 2025-05-07T19:43:01.1355898Z x86_64 2025-05-07T19:43:01.1355905Z 2025-05-07T19:43:01.1356000Z + cat /proc/version 2025-05-07T19:43:01.1356005Z 2025-05-07T19:43:01.1356589Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:01.1356598Z 2025-05-07T19:43:01.1356683Z + cat /etc/os-release 2025-05-07T19:43:01.1356688Z 2025-05-07T19:43:01.1356793Z NAME="Amazon Linux" 2025-05-07T19:43:01.1356872Z VERSION="2023" 2025-05-07T19:43:01.1356948Z ID="amzn" 2025-05-07T19:43:01.1357029Z ID_LIKE="fedora" 2025-05-07T19:43:01.1357109Z VERSION_ID="2023" 2025-05-07T19:43:01.1357206Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:01.1357313Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:01.1357399Z ANSI_COLOR="0;33" 2025-05-07T19:43:01.1357517Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:01.1357696Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:01.1357869Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:01.1358023Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:01.1358211Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:01.1358289Z VENDOR_NAME="AWS" 2025-05-07T19:43:01.1358405Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:01.1358493Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:01.1358497Z 2025-05-07T19:43:01.1389167Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:01.1389326Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:01.1389579Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:01.1389655Z env: 2025-05-07T19:43:01.1389768Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:01.1389866Z BUILD_ENV: build_binary 2025-05-07T19:43:01.1389949Z BUILD_TARGET: default 2025-05-07T19:43:01.1390030Z BUILD_VARIANT: cuda 2025-05-07T19:43:01.1390136Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:01.1390212Z ##[endgroup] 2025-05-07T19:43:01.6226798Z ################################################################################ 2025-05-07T19:43:01.6227378Z [INFO] Printing general display info ... 2025-05-07T19:43:01.6239852Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:01.7103669Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:01.7115957Z /usr/bin/sudo 2025-05-07T19:43:01.7126985Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:01.7135931Z /usr/bin/yum 2025-05-07T19:43:01.7136799Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:01.7157160Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:01.9362597Z Last metadata expiration check: 0:00:17 ago on Wed May 7 19:42:44 2025. 2025-05-07T19:43:02.0328456Z Dependencies resolved. 2025-05-07T19:43:02.0543862Z Nothing to do. 2025-05-07T19:43:02.0544278Z Complete! 2025-05-07T19:43:02.1000467Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:02.1029701Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:02.3265461Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:44 2025. 2025-05-07T19:43:02.3782316Z Dependencies resolved. 2025-05-07T19:43:02.3948572Z ================================================================================ 2025-05-07T19:43:02.3950584Z Package Arch Version Repository Size 2025-05-07T19:43:02.3951788Z ================================================================================ 2025-05-07T19:43:02.3952728Z Installing: 2025-05-07T19:43:02.3953638Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:02.3954970Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:02.3955245Z 2025-05-07T19:43:02.3955375Z Transaction Summary 2025-05-07T19:43:02.3955631Z ================================================================================ 2025-05-07T19:43:02.3956078Z Install 2 Packages 2025-05-07T19:43:02.3956220Z 2025-05-07T19:43:02.3956509Z Total download size: 347 k 2025-05-07T19:43:02.3956826Z Installed size: 883 k 2025-05-07T19:43:02.3957189Z Downloading Packages: 2025-05-07T19:43:02.6826333Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.7 MB/s | 28 kB 00:00 2025-05-07T19:43:02.6954968Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 11 MB/s | 319 kB 00:00 2025-05-07T19:43:02.6960608Z -------------------------------------------------------------------------------- 2025-05-07T19:43:02.6963410Z Total 1.1 MB/s | 347 kB 00:00 2025-05-07T19:43:02.7183122Z Running transaction check 2025-05-07T19:43:02.7234591Z Transaction check succeeded. 2025-05-07T19:43:02.7235533Z Running transaction test 2025-05-07T19:43:02.7388177Z Transaction test succeeded. 2025-05-07T19:43:02.7388531Z Running transaction 2025-05-07T19:43:02.7655217Z Preparing : 1/1 2025-05-07T19:43:02.7726561Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:02.7757096Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:03.8113494Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:03.8114506Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:03.8477984Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:03.8478538Z 2025-05-07T19:43:03.8478678Z Installed: 2025-05-07T19:43:03.8479261Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:03.8479817Z 2025-05-07T19:43:03.8479968Z Complete! 2025-05-07T19:43:03.8838907Z + hostname 2025-05-07T19:43:03.8839090Z 2025-05-07T19:43:03.8850433Z 12a11cea79f2 2025-05-07T19:43:03.8851449Z 2025-05-07T19:43:03.8851741Z + sudo lshw -C display 2025-05-07T19:43:03.8852202Z 2025-05-07T19:43:04.0846831Z *-display UNCLAIMED 2025-05-07T19:43:04.0847695Z description: VGA compatible controller 2025-05-07T19:43:04.0848667Z product: Amazon.com, Inc. 2025-05-07T19:43:04.0849501Z vendor: Amazon.com, Inc. 2025-05-07T19:43:04.0850312Z physical id: 3 2025-05-07T19:43:04.0851013Z bus info: pci@0000:00:03.0 2025-05-07T19:43:04.0851753Z version: 00 2025-05-07T19:43:04.0852404Z width: 32 bits 2025-05-07T19:43:04.0853036Z clock: 33MHz 2025-05-07T19:43:04.0853871Z capabilities: vga_controller bus_master 2025-05-07T19:43:04.0854173Z configuration: latency=0 2025-05-07T19:43:04.0854513Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:04.0872307Z 2025-05-07T19:43:04.0872758Z ################################################################################ 2025-05-07T19:43:04.1013924Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:04.1014356Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:04.1047238Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:04.1047778Z [CHECK] nvidia-smi not found 2025-05-07T19:43:04.1048140Z ################################################################################ 2025-05-07T19:43:04.1048827Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:04.1165878Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:04.1197853Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:04.1198359Z [CHECK] rocminfo not found 2025-05-07T19:43:04.1207832Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:04.1208345Z [CHECK] rocm-smi not found 2025-05-07T19:43:04.1291048Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:04.1291521Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:04.1292089Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:04.1292420Z env: 2025-05-07T19:43:04.1292681Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:04.1292984Z BUILD_ENV: build_binary 2025-05-07T19:43:04.1293265Z BUILD_TARGET: default 2025-05-07T19:43:04.1293502Z BUILD_VARIANT: cuda 2025-05-07T19:43:04.1293786Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:04.1294041Z ##[endgroup] 2025-05-07T19:43:04.5814425Z ################################################################################ 2025-05-07T19:43:04.5815475Z # Setup Miniconda 2025-05-07T19:43:04.5816103Z # 2025-05-07T19:43:04.5829801Z # [2025-05-07T19:43:04.582Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:04.5831062Z ################################################################################ 2025-05-07T19:43:04.5831899Z 2025-05-07T19:43:04.5841930Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:04.6736271Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:04.6737339Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:04.6737946Z 2025-05-07T19:43:04.6747276Z 2025-05-07T19:43:04.6747930Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:04.6783121Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:05.6159435Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:05.6159952Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:05.6160224Z 2025-05-07T19:43:05.6302617Z PREFIX=/github/home/miniconda 2025-05-07T19:43:05.9825108Z Unpacking payload ... 2025-05-07T19:43:06.4685146Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:07.1419101Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:09.0033057Z 2025-05-07T19:43:09.0033883Z Installing base environment... 2025-05-07T19:43:09.0034568Z 2025-05-07T19:43:09.9877134Z Preparing transaction: ...working... done 2025-05-07T19:43:12.8372373Z Executing transaction: ...working... done 2025-05-07T19:43:13.3827105Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:13.4498819Z installation finished. 2025-05-07T19:43:13.4505372Z 2025-05-07T19:43:13.4505557Z + rm -f miniconda.sh 2025-05-07T19:43:13.4505810Z 2025-05-07T19:43:13.4676683Z 2025-05-07T19:43:13.4677239Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:13.4677786Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:13.4678029Z 2025-05-07T19:43:13.8379619Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:13.8380075Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:13.8380492Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:13.8380879Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:13.8381293Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:13.8381750Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:13.8382494Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:13.8383004Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:13.8383489Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:13.8384087Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:13.8384807Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:13.8385236Z modified /github/home/.bashrc 2025-05-07T19:43:13.8385443Z 2025-05-07T19:43:13.8385699Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:13.8386027Z 2025-05-07T19:43:13.8921087Z 2025-05-07T19:43:13.8921654Z + . /github/home/.bashrc 2025-05-07T19:43:13.8921931Z 2025-05-07T19:43:14.6806835Z 2025-05-07T19:43:14.6807793Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:14.6833832Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:26.4625885Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:27.9084431Z Solving environment: | / - \ | / - \ | / - done 2025-05-07T19:43:27.9973990Z 2025-05-07T19:43:27.9974457Z ## Package Plan ## 2025-05-07T19:43:27.9974992Z 2025-05-07T19:43:27.9975478Z environment location: /github/home/miniconda 2025-05-07T19:43:27.9976198Z 2025-05-07T19:43:27.9976583Z added / updated specs: 2025-05-07T19:43:27.9977339Z - conda-libmamba-solver 2025-05-07T19:43:27.9978160Z - libarchive 2025-05-07T19:43:27.9978771Z - libmamba 2025-05-07T19:43:27.9979397Z - libmambapy 2025-05-07T19:43:27.9979775Z 2025-05-07T19:43:27.9979787Z 2025-05-07T19:43:27.9980177Z The following packages will be downloaded: 2025-05-07T19:43:27.9980845Z 2025-05-07T19:43:27.9980976Z package | build 2025-05-07T19:43:27.9981379Z ---------------------------|----------------- 2025-05-07T19:43:27.9981844Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:27.9982385Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:27.9982849Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:27.9983393Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:27.9983906Z ------------------------------------------------------------ 2025-05-07T19:43:27.9984286Z Total: 1.4 MB 2025-05-07T19:43:27.9984519Z 2025-05-07T19:43:27.9984672Z The following packages will be UPDATED: 2025-05-07T19:43:27.9984897Z 2025-05-07T19:43:27.9989410Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:27.9990252Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:27.9990663Z 2025-05-07T19:43:27.9990925Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:27.9991250Z 2025-05-07T19:43:27.9991570Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:27.9992391Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:27.9992875Z 2025-05-07T19:43:27.9992879Z 2025-05-07T19:43:27.9993199Z 2025-05-07T19:43:27.9993383Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:27.9993769Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:27.9994032Z 2025-05-07T19:43:27.9994360Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:27.9994629Z 2025-05-07T19:43:27.9994632Z 2025-05-07T19:43:27.9997870Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:27.9998155Z 2025-05-07T19:43:27.9998159Z 2025-05-07T19:43:27.9998383Z 2025-05-07T19:43:28.0479746Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:28.0480111Z 2025-05-07T19:43:28.0480116Z 2025-05-07T19:43:28.0485684Z 2025-05-07T19:43:28.0620459Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:28.0621399Z 2025-05-07T19:43:28.0622103Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:28.0622843Z 2025-05-07T19:43:28.0622855Z 2025-05-07T19:43:28.0622866Z 2025-05-07T19:43:28.0636715Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:28.0752069Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:28.0752392Z 2025-05-07T19:43:28.0785966Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:28.0786274Z 2025-05-07T19:43:28.0786279Z 2025-05-07T19:43:28.0877652Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:28.0878004Z 2025-05-07T19:43:28.0878009Z 2025-05-07T19:43:28.1689421Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:28.1690708Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:28.1694036Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:28.1694456Z 2025-05-07T19:43:28.1694678Z 2025-05-07T19:43:28.1694911Z  2025-05-07T19:43:28.1695166Z 2025-05-07T19:43:28.1695171Z 2025-05-07T19:43:28.1695375Z  2025-05-07T19:43:28.1695606Z 2025-05-07T19:43:28.1695611Z 2025-05-07T19:43:28.1695643Z 2025-05-07T19:43:28.1698072Z  done 2025-05-07T19:43:28.2706592Z Preparing transaction: | done 2025-05-07T19:43:28.3719007Z Verifying transaction: - done 2025-05-07T19:43:29.6750352Z Executing transaction: | / - \ | / - \ | / - \ | done 2025-05-07T19:43:31.2298587Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:31.2320720Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:31.9589550Z Channels: 2025-05-07T19:43:31.9590280Z - defaults 2025-05-07T19:43:31.9590925Z Platform: linux-64 2025-05-07T19:43:33.0714680Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:33.2058385Z Solving environment: / - Channels: 2025-05-07T19:43:33.2058769Z - defaults 2025-05-07T19:43:33.2059092Z Platform: linux-64 2025-05-07T19:43:33.4800798Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:33.6988411Z Solving environment: / - \ | done 2025-05-07T19:43:33.7886910Z done 2025-05-07T19:43:33.8545132Z 2025-05-07T19:43:33.8545736Z ## Package Plan ## 2025-05-07T19:43:33.8546262Z 2025-05-07T19:43:33.8547012Z environment location: /github/home/miniconda 2025-05-07T19:43:33.8547754Z 2025-05-07T19:43:33.8548037Z added / updated specs: 2025-05-07T19:43:33.8548827Z - conda 2025-05-07T19:43:33.8549171Z 2025-05-07T19:43:33.8549182Z 2025-05-07T19:43:33.8549539Z The following packages will be downloaded: 2025-05-07T19:43:33.8550232Z 2025-05-07T19:43:33.8550392Z package | build 2025-05-07T19:43:33.8550742Z ---------------------------|----------------- 2025-05-07T19:43:33.8551151Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:33.8551567Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:33.8552376Z ------------------------------------------------------------ 2025-05-07T19:43:33.8552777Z Total: 1.4 MB 2025-05-07T19:43:33.8553008Z 2025-05-07T19:43:33.8553136Z The following packages will be UPDATED: 2025-05-07T19:43:33.8553360Z 2025-05-07T19:43:33.8553729Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:33.8554430Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:33.8554735Z 2025-05-07T19:43:33.8554739Z 2025-05-07T19:43:33.8554743Z 2025-05-07T19:43:33.8554902Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:33.8555331Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:33.8555567Z 2025-05-07T19:43:33.8870027Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:33.8870342Z 2025-05-07T19:43:33.9245389Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:34.0756896Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:34.0757660Z 2025-05-07T19:43:34.0758433Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:34.0759195Z 2025-05-07T19:43:34.0970220Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:34.0971242Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:34.0973271Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:34.0973674Z 2025-05-07T19:43:34.0973896Z 2025-05-07T19:43:34.0977136Z  done 2025-05-07T19:43:34.1986358Z Preparing transaction: - done 2025-05-07T19:43:34.2994374Z Verifying transaction: | done 2025-05-07T19:43:36.2028283Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:43:36.7759150Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:36.7759760Z + conda clean --packages --tarball -y 2025-05-07T19:43:36.7759997Z 2025-05-07T19:43:37.2137125Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:37.2137580Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:37.2678756Z 2025-05-07T19:43:37.2683083Z + conda clean --all -y 2025-05-07T19:43:37.2683359Z 2025-05-07T19:43:37.7200674Z There are no unused tarball(s) to remove. 2025-05-07T19:43:37.7201134Z Will remove 1 index cache(s). 2025-05-07T19:43:37.7201498Z There are no unused package(s) to remove. 2025-05-07T19:43:37.7201888Z There are no tempfile(s) to remove. 2025-05-07T19:43:37.7202206Z There are no logfile(s) to remove. 2025-05-07T19:43:37.7750447Z 2025-05-07T19:43:37.7750981Z + conda info 2025-05-07T19:43:38.3431115Z 2025-05-07T19:43:38.3431136Z 2025-05-07T19:43:38.3431636Z active environment : base 2025-05-07T19:43:38.3432338Z active env location : /github/home/miniconda 2025-05-07T19:43:38.3432729Z shell level : 1 2025-05-07T19:43:38.3433105Z user config file : /github/home/.condarc 2025-05-07T19:43:38.3433517Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:38.3433938Z conda version : 25.3.1 2025-05-07T19:43:38.3434275Z conda-build version : not installed 2025-05-07T19:43:38.3434604Z python version : 3.13.2.final.0 2025-05-07T19:43:38.3434957Z solver : libmamba (default) 2025-05-07T19:43:38.3435324Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:38.3435707Z __conda=25.3.1=0 2025-05-07T19:43:38.3436143Z __glibc=2.34=0 2025-05-07T19:43:38.3436487Z __linux=6.1.130=0 2025-05-07T19:43:38.3436861Z __unix=0=0 2025-05-07T19:43:38.3437245Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:38.3437716Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:38.3438091Z conda av metadata url : None 2025-05-07T19:43:38.3438837Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:38.3439291Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:38.3439737Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:38.3440136Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:38.3440544Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:38.3441038Z /github/home/.conda/pkgs 2025-05-07T19:43:38.3441432Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:38.3441831Z /github/home/.conda/envs 2025-05-07T19:43:38.3442156Z platform : linux-64 2025-05-07T19:43:38.3443151Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:38.3443999Z UID:GID : 0:0 2025-05-07T19:43:38.3444297Z netrc file : None 2025-05-07T19:43:38.3444566Z offline mode : False 2025-05-07T19:43:38.3444765Z 2025-05-07T19:43:38.4020249Z 2025-05-07T19:43:38.4020643Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:38.4021341Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_d99757cc-aaa6-41dc-8644-5f6cd0e8d1b4 ... 2025-05-07T19:43:38.4022071Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:38.4166470Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.10 2025-05-07T19:43:38.4167020Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.10 2025-05-07T19:43:38.4167807Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:38.4168233Z env: 2025-05-07T19:43:38.4168460Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:38.4168748Z BUILD_ENV: build_binary 2025-05-07T19:43:38.4169011Z BUILD_TARGET: default 2025-05-07T19:43:38.4169246Z BUILD_VARIANT: cuda 2025-05-07T19:43:38.4169464Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:38.4169712Z ##[endgroup] 2025-05-07T19:43:38.8838536Z ################################################################################ 2025-05-07T19:43:38.8838919Z # Create Conda Environment 2025-05-07T19:43:38.8839194Z # 2025-05-07T19:43:38.8866046Z # [2025-05-07T19:43:38.885Z] + create_conda_environment build_binary 3.10 2025-05-07T19:43:38.8866695Z ################################################################################ 2025-05-07T19:43:38.8866941Z 2025-05-07T19:43:38.8886216Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:38.9719628Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:38.9720800Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:38.9721779Z + conda info --envs 2025-05-07T19:43:38.9722188Z 2025-05-07T19:43:39.5341565Z 2025-05-07T19:43:39.5342006Z # conda environments: 2025-05-07T19:43:39.5342391Z # 2025-05-07T19:43:39.5342649Z base /github/home/miniconda 2025-05-07T19:43:39.5342887Z 2025-05-07T19:43:39.5937325Z 2025-05-07T19:43:39.5937931Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:41.2335204Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:41.2336034Z 2025-05-07T19:43:41.2356251Z 2025-05-07T19:43:41.2364236Z [SETUP] Creating new Conda environment (Python 3.10) ... 2025-05-07T19:43:41.2390810Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.10 2025-05-07T19:43:41.8144092Z Channels: 2025-05-07T19:43:41.8144752Z - defaults 2025-05-07T19:43:41.8145350Z Platform: linux-64 2025-05-07T19:43:43.1222052Z Collecting package metadata (repodata.json): - \ | / - \ | / done 2025-05-07T19:43:43.2229075Z Solving environment: \ done 2025-05-07T19:43:43.2519054Z 2025-05-07T19:43:43.2519624Z ## Package Plan ## 2025-05-07T19:43:43.2519880Z 2025-05-07T19:43:43.2520454Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:43.2520802Z 2025-05-07T19:43:43.2520964Z added / updated specs: 2025-05-07T19:43:43.2521252Z - python=3.10 2025-05-07T19:43:43.2521404Z 2025-05-07T19:43:43.2521408Z 2025-05-07T19:43:43.2521581Z The following packages will be downloaded: 2025-05-07T19:43:43.2521818Z 2025-05-07T19:43:43.2521950Z package | build 2025-05-07T19:43:43.2522334Z ---------------------------|----------------- 2025-05-07T19:43:43.2522747Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:43.2523212Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:43.2523736Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:43.2524174Z python-3.10.16 | he870216_1 26.9 MB 2025-05-07T19:43:43.2524635Z setuptools-78.1.1 | py310h06a4308_0 1.7 MB 2025-05-07T19:43:43.2525055Z wheel-0.45.1 | py310h06a4308_0 115 KB 2025-05-07T19:43:43.2525483Z ------------------------------------------------------------ 2025-05-07T19:43:43.2525883Z Total: 28.8 MB 2025-05-07T19:43:43.2526098Z 2025-05-07T19:43:43.2526238Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:43.2526478Z 2025-05-07T19:43:43.2526722Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:43.2527170Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:43.2527808Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:43.2528342Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:43.2528911Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:43.2529443Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:43.2529906Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:43.2530417Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:43.2530935Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:43.2531417Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:43.2531864Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:43.2532310Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:43.2532772Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:43.2533175Z python pkgs/main/linux-64::python-3.10.16-he870216_1 2025-05-07T19:43:43.2533616Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:43.2534105Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py310h06a4308_0 2025-05-07T19:43:43.2534579Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:43.2534990Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:43.2535373Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:43.2535812Z wheel pkgs/main/linux-64::wheel-0.45.1-py310h06a4308_0 2025-05-07T19:43:43.2536227Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:43.2536602Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:43.2536868Z 2025-05-07T19:43:43.2536872Z 2025-05-07T19:43:43.2536875Z 2025-05-07T19:43:43.2537028Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:43.2537405Z python-3.10.16 | 26.9 MB | | 0% 2025-05-07T19:43:43.2537658Z 2025-05-07T19:43:43.2538174Z setuptools-78.1.1 | 1.7 MB | | 0%  2025-05-07T19:43:43.2538425Z 2025-05-07T19:43:43.2538428Z 2025-05-07T19:43:43.2556786Z ca-certificates-2025 | 129 KB | | 0%  2025-05-07T19:43:43.2557093Z 2025-05-07T19:43:43.2558520Z 2025-05-07T19:43:43.2558534Z 2025-05-07T19:43:43.2570239Z wheel-0.45.1 | 115 KB | | 0%  2025-05-07T19:43:43.2570496Z 2025-05-07T19:43:43.2570500Z 2025-05-07T19:43:43.2570504Z 2025-05-07T19:43:43.2575885Z 2025-05-07T19:43:43.2603705Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:43.2604036Z 2025-05-07T19:43:43.2604041Z 2025-05-07T19:43:43.2604044Z 2025-05-07T19:43:43.2604048Z 2025-05-07T19:43:43.2608364Z 2025-05-07T19:43:43.2980967Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:43.2981342Z 2025-05-07T19:43:43.2981767Z 2025-05-07T19:43:43.2981781Z 2025-05-07T19:43:43.2981791Z 2025-05-07T19:43:43.3196189Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:43.3196583Z 2025-05-07T19:43:43.3196591Z 2025-05-07T19:43:43.3196597Z 2025-05-07T19:43:43.3196604Z 2025-05-07T19:43:43.3249659Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:43.3250017Z 2025-05-07T19:43:43.3250022Z 2025-05-07T19:43:43.3250026Z 2025-05-07T19:43:43.3250029Z 2025-05-07T19:43:43.3250034Z 2025-05-07T19:43:43.3342709Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:43.3343196Z 2025-05-07T19:43:43.3343214Z 2025-05-07T19:43:43.3357155Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:43.3357461Z 2025-05-07T19:43:43.3357466Z 2025-05-07T19:43:43.3357472Z 2025-05-07T19:43:43.3357477Z 2025-05-07T19:43:43.3357487Z 2025-05-07T19:43:43.3511381Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:43.3511754Z 2025-05-07T19:43:43.3511759Z 2025-05-07T19:43:43.3520844Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:43.3524703Z python-3.10.16 | 26.9 MB | #5 | 15% 2025-05-07T19:43:43.3524956Z 2025-05-07T19:43:43.3524961Z 2025-05-07T19:43:43.3525469Z 2025-05-07T19:43:43.3564517Z wheel-0.45.1 | 115 KB | ########## | 100%  2025-05-07T19:43:43.3564870Z 2025-05-07T19:43:43.3565113Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:43.3565391Z 2025-05-07T19:43:43.4018772Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:43.4019072Z 2025-05-07T19:43:43.4019079Z 2025-05-07T19:43:43.4019086Z 2025-05-07T19:43:43.4019320Z wheel-0.45.1 | 115 KB | ########## | 100%  2025-05-07T19:43:43.4019573Z 2025-05-07T19:43:43.4019578Z 2025-05-07T19:43:43.4019582Z 2025-05-07T19:43:43.4522114Z wheel-0.45.1 | 115 KB | ########## | 100%  2025-05-07T19:43:43.5632426Z python-3.10.16 | 26.9 MB | ###9 | 39% 2025-05-07T19:43:43.5866982Z python-3.10.16 | 26.9 MB | ######6 | 67% 2025-05-07T19:43:43.5867468Z 2025-05-07T19:43:43.7149346Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:43.7150604Z python-3.10.16 | 26.9 MB | ########## | 100% 2025-05-07T19:43:44.2464853Z python-3.10.16 | 26.9 MB | ########## | 100% 2025-05-07T19:43:44.2474910Z python-3.10.16 | 26.9 MB | ########## | 100% 2025-05-07T19:43:44.2476147Z 2025-05-07T19:43:44.2476773Z 2025-05-07T19:43:44.2477527Z  2025-05-07T19:43:44.2478242Z 2025-05-07T19:43:44.2478256Z 2025-05-07T19:43:44.2478770Z  2025-05-07T19:43:44.2479410Z 2025-05-07T19:43:44.2479423Z 2025-05-07T19:43:44.2479434Z 2025-05-07T19:43:44.2480027Z  2025-05-07T19:43:44.2480602Z 2025-05-07T19:43:44.2480611Z 2025-05-07T19:43:44.2480614Z 2025-05-07T19:43:44.2480618Z 2025-05-07T19:43:44.2480810Z  2025-05-07T19:43:44.2481079Z 2025-05-07T19:43:44.2481086Z 2025-05-07T19:43:44.2481090Z 2025-05-07T19:43:44.2481093Z 2025-05-07T19:43:44.2481347Z 2025-05-07T19:43:44.2481563Z  done 2025-05-07T19:43:44.4591395Z Preparing transaction: / - done 2025-05-07T19:43:45.5969130Z Verifying transaction: | / - \ | / - \ | / - done 2025-05-07T19:43:47.7099613Z Executing transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:43:47.7140806Z # 2025-05-07T19:43:47.7141583Z # To activate this environment, use 2025-05-07T19:43:47.7142474Z # 2025-05-07T19:43:47.7143090Z # $ conda activate build_binary 2025-05-07T19:43:47.7143907Z # 2025-05-07T19:43:47.7144523Z # To deactivate an active environment, use 2025-05-07T19:43:47.7145411Z # 2025-05-07T19:43:47.7145949Z # $ conda deactivate 2025-05-07T19:43:47.7146870Z 2025-05-07T19:43:47.8013930Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:47.8038956Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:50.6349379Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:50.6350917Z 2025-05-07T19:43:50.6351354Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (25.1) 2025-05-07T19:43:50.6352313Z Collecting pip 2025-05-07T19:43:50.6352681Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:50.6353157Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:50.6354016Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 84.1 MB/s eta 0:00:00 2025-05-07T19:43:50.6354436Z Installing collected packages: pip 2025-05-07T19:43:50.6354762Z Attempting uninstall: pip 2025-05-07T19:43:50.6355127Z Found existing installation: pip 25.1 2025-05-07T19:43:50.6355518Z Uninstalling pip-25.1: 2025-05-07T19:43:50.6355915Z Successfully uninstalled pip-25.1 2025-05-07T19:43:50.6356284Z Successfully installed pip-25.1.1 2025-05-07T19:43:50.6356490Z 2025-05-07T19:43:50.7134747Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:50.7168238Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:51.3781416Z Channels: 2025-05-07T19:43:51.3782072Z - conda-forge 2025-05-07T19:43:51.3782779Z Platform: linux-64 2025-05-07T19:44:01.0720619Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:44:02.9117668Z Solving environment: / - \ | / done 2025-05-07T19:44:02.9560023Z 2025-05-07T19:44:02.9560941Z ## Package Plan ## 2025-05-07T19:44:02.9561199Z 2025-05-07T19:44:02.9561440Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:02.9561853Z 2025-05-07T19:44:02.9562007Z added / updated specs: 2025-05-07T19:44:02.9562326Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:02.9562562Z 2025-05-07T19:44:02.9562567Z 2025-05-07T19:44:02.9562708Z The following packages will be downloaded: 2025-05-07T19:44:02.9562948Z 2025-05-07T19:44:02.9563126Z package | build 2025-05-07T19:44:02.9563494Z ---------------------------|----------------- 2025-05-07T19:44:02.9563931Z cffi-1.17.1 | py310h8deb56e_0 238 KB conda-forge 2025-05-07T19:44:02.9564445Z cryptography-44.0.3 | py310h6c63255_0 1.5 MB conda-forge 2025-05-07T19:44:02.9564954Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:02.9565404Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:02.9565881Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:02.9567932Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:02.9568403Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:02.9568912Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:02.9569379Z python_abi-3.10 | 2_cp310 4 KB conda-forge 2025-05-07T19:44:02.9569906Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:02.9570443Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:02.9570934Z ------------------------------------------------------------ 2025-05-07T19:44:02.9571343Z Total: 6.3 MB 2025-05-07T19:44:02.9571575Z 2025-05-07T19:44:02.9571715Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:02.9571982Z 2025-05-07T19:44:02.9572225Z cffi conda-forge/linux-64::cffi-1.17.1-py310h8deb56e_0 2025-05-07T19:44:02.9572769Z cryptography conda-forge/linux-64::cryptography-44.0.3-py310h6c63255_0 2025-05-07T19:44:02.9573333Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:02.9573874Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:02.9574382Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:02.9574905Z python_abi conda-forge/linux-64::python_abi-3.10-2_cp310 2025-05-07T19:44:02.9575659Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:02.9576318Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:02.9576686Z 2025-05-07T19:44:02.9576842Z The following packages will be UPDATED: 2025-05-07T19:44:02.9577063Z 2025-05-07T19:44:02.9577483Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:02.9578370Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:02.9579070Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:02.9579788Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:02.9580187Z 2025-05-07T19:44:02.9580191Z 2025-05-07T19:44:02.9580194Z 2025-05-07T19:44:02.9580388Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:02.9580805Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:02.9581091Z 2025-05-07T19:44:02.9581540Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:02.9581822Z 2025-05-07T19:44:02.9581826Z 2025-05-07T19:44:02.9582431Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:02.9582693Z 2025-05-07T19:44:02.9582697Z 2025-05-07T19:44:02.9582700Z 2025-05-07T19:44:02.9592546Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:02.9592862Z 2025-05-07T19:44:02.9592866Z 2025-05-07T19:44:02.9592869Z 2025-05-07T19:44:02.9592877Z 2025-05-07T19:44:02.9619761Z cffi-1.17.1 | 238 KB | | 0%  2025-05-07T19:44:02.9620095Z 2025-05-07T19:44:02.9620298Z 2025-05-07T19:44:02.9620308Z 2025-05-07T19:44:02.9620314Z 2025-05-07T19:44:02.9620320Z 2025-05-07T19:44:02.9620898Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:02.9621268Z 2025-05-07T19:44:02.9621286Z 2025-05-07T19:44:02.9621289Z 2025-05-07T19:44:02.9621293Z 2025-05-07T19:44:02.9621297Z 2025-05-07T19:44:02.9621300Z 2025-05-07T19:44:02.9621589Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:02.9621927Z 2025-05-07T19:44:02.9621931Z 2025-05-07T19:44:02.9621935Z 2025-05-07T19:44:02.9621938Z 2025-05-07T19:44:02.9621943Z 2025-05-07T19:44:02.9621946Z 2025-05-07T19:44:02.9621950Z 2025-05-07T19:44:02.9622394Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:02.9622724Z 2025-05-07T19:44:02.9622727Z 2025-05-07T19:44:02.9622731Z 2025-05-07T19:44:02.9622766Z 2025-05-07T19:44:02.9622770Z 2025-05-07T19:44:02.9622774Z 2025-05-07T19:44:02.9622778Z 2025-05-07T19:44:02.9622782Z 2025-05-07T19:44:02.9623067Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:02.9623392Z 2025-05-07T19:44:02.9623395Z 2025-05-07T19:44:02.9623399Z 2025-05-07T19:44:02.9623403Z 2025-05-07T19:44:02.9623411Z 2025-05-07T19:44:02.9623447Z 2025-05-07T19:44:02.9623451Z 2025-05-07T19:44:02.9623454Z 2025-05-07T19:44:02.9623458Z 2025-05-07T19:44:02.9624762Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:02.9625063Z 2025-05-07T19:44:02.9625067Z 2025-05-07T19:44:02.9625071Z 2025-05-07T19:44:02.9625074Z 2025-05-07T19:44:02.9625078Z 2025-05-07T19:44:02.9625113Z 2025-05-07T19:44:02.9625122Z 2025-05-07T19:44:02.9625125Z 2025-05-07T19:44:02.9625129Z 2025-05-07T19:44:02.9625136Z 2025-05-07T19:44:03.0400582Z python_abi-3.10 | 4 KB | | 0%  2025-05-07T19:44:03.0401515Z 2025-05-07T19:44:03.0401529Z 2025-05-07T19:44:03.0401575Z 2025-05-07T19:44:03.0401587Z 2025-05-07T19:44:03.0433842Z cffi-1.17.1 | 238 KB | ########## | 100%  2025-05-07T19:44:03.0434147Z 2025-05-07T19:44:03.0434152Z 2025-05-07T19:44:03.0434156Z 2025-05-07T19:44:03.0438021Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:03.0438334Z 2025-05-07T19:44:03.0654231Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:03.0826350Z openssl-3.5.0 | 3.0 MB | | 1% 2025-05-07T19:44:03.0826846Z 2025-05-07T19:44:03.1084596Z 2025-05-07T19:44:03.1085012Z libgcc-15.1.0 | 810 KB | 1 | 2%  2025-05-07T19:44:03.1085316Z 2025-05-07T19:44:03.1085320Z 2025-05-07T19:44:03.1085349Z 2025-05-07T19:44:03.1085354Z 2025-05-07T19:44:03.1085357Z 2025-05-07T19:44:03.1085361Z 2025-05-07T19:44:03.1085365Z 2025-05-07T19:44:03.1119180Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:03.1119525Z 2025-05-07T19:44:03.1119530Z 2025-05-07T19:44:03.1119534Z 2025-05-07T19:44:03.1119538Z 2025-05-07T19:44:03.1119542Z 2025-05-07T19:44:03.1119545Z 2025-05-07T19:44:03.1119889Z 2025-05-07T19:44:03.1269780Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:03.1270168Z 2025-05-07T19:44:03.1270175Z 2025-05-07T19:44:03.1270181Z 2025-05-07T19:44:03.1270186Z 2025-05-07T19:44:03.1270619Z cffi-1.17.1 | 238 KB | ########## | 100%  2025-05-07T19:44:03.1270884Z 2025-05-07T19:44:03.1270888Z 2025-05-07T19:44:03.1270891Z 2025-05-07T19:44:03.1270894Z 2025-05-07T19:44:03.1274171Z cffi-1.17.1 | 238 KB | ########## | 100%  2025-05-07T19:44:03.1274451Z 2025-05-07T19:44:03.1274465Z 2025-05-07T19:44:03.1274477Z 2025-05-07T19:44:03.1279531Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:03.1279803Z 2025-05-07T19:44:03.1279833Z 2025-05-07T19:44:03.1279844Z 2025-05-07T19:44:03.1290761Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:03.1291035Z 2025-05-07T19:44:03.1291370Z 2025-05-07T19:44:03.1367914Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:03.1368195Z 2025-05-07T19:44:03.1368200Z 2025-05-07T19:44:03.1368204Z 2025-05-07T19:44:03.1368220Z 2025-05-07T19:44:03.1368224Z 2025-05-07T19:44:03.1369411Z 2025-05-07T19:44:03.1403491Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:03.1403829Z 2025-05-07T19:44:03.1403834Z 2025-05-07T19:44:03.1403838Z 2025-05-07T19:44:03.1403841Z 2025-05-07T19:44:03.1403845Z 2025-05-07T19:44:03.1403849Z 2025-05-07T19:44:03.1504973Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:03.1505560Z 2025-05-07T19:44:03.1505564Z 2025-05-07T19:44:03.1505568Z 2025-05-07T19:44:03.1505572Z 2025-05-07T19:44:03.1505575Z 2025-05-07T19:44:03.1505579Z 2025-05-07T19:44:03.1505582Z 2025-05-07T19:44:03.1600927Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:03.1652668Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:03.1652982Z 2025-05-07T19:44:03.1652987Z 2025-05-07T19:44:03.1652990Z 2025-05-07T19:44:03.1653017Z 2025-05-07T19:44:03.1653021Z 2025-05-07T19:44:03.1662397Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:03.1662864Z 2025-05-07T19:44:03.1662868Z 2025-05-07T19:44:03.1662871Z 2025-05-07T19:44:03.1662875Z 2025-05-07T19:44:03.1662879Z 2025-05-07T19:44:03.1662882Z 2025-05-07T19:44:03.1662886Z 2025-05-07T19:44:03.1662910Z 2025-05-07T19:44:03.1686853Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:03.1687223Z 2025-05-07T19:44:03.1687227Z 2025-05-07T19:44:03.1687230Z 2025-05-07T19:44:03.1687234Z 2025-05-07T19:44:03.1687238Z 2025-05-07T19:44:03.1687249Z 2025-05-07T19:44:03.1687253Z 2025-05-07T19:44:03.1687278Z 2025-05-07T19:44:03.1705717Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:03.1706074Z 2025-05-07T19:44:03.1706079Z 2025-05-07T19:44:03.1706083Z 2025-05-07T19:44:03.1706087Z 2025-05-07T19:44:03.1706090Z 2025-05-07T19:44:03.1927798Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:03.1928340Z 2025-05-07T19:44:03.1928346Z 2025-05-07T19:44:03.1928350Z 2025-05-07T19:44:03.1928354Z 2025-05-07T19:44:03.1928357Z 2025-05-07T19:44:03.1928361Z 2025-05-07T19:44:03.1928365Z 2025-05-07T19:44:03.1928397Z 2025-05-07T19:44:03.1928574Z 2025-05-07T19:44:03.1948547Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:03.1948885Z 2025-05-07T19:44:03.1948890Z 2025-05-07T19:44:03.1948906Z 2025-05-07T19:44:03.1948910Z 2025-05-07T19:44:03.1948938Z 2025-05-07T19:44:03.1948942Z 2025-05-07T19:44:03.1948946Z 2025-05-07T19:44:03.1948949Z 2025-05-07T19:44:03.1948953Z 2025-05-07T19:44:03.1954984Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:03.1955282Z 2025-05-07T19:44:03.1955297Z 2025-05-07T19:44:03.1955300Z 2025-05-07T19:44:03.1955324Z 2025-05-07T19:44:03.1955328Z 2025-05-07T19:44:03.1955332Z 2025-05-07T19:44:03.1955335Z 2025-05-07T19:44:03.1955339Z 2025-05-07T19:44:03.1955349Z 2025-05-07T19:44:03.1955352Z 2025-05-07T19:44:03.1961757Z python_abi-3.10 | 4 KB | ########## | 100%  2025-05-07T19:44:03.1962056Z 2025-05-07T19:44:03.1962060Z 2025-05-07T19:44:03.1962090Z 2025-05-07T19:44:03.1962094Z 2025-05-07T19:44:03.1962097Z 2025-05-07T19:44:03.1962101Z 2025-05-07T19:44:03.1962114Z 2025-05-07T19:44:03.1962117Z 2025-05-07T19:44:03.1962121Z 2025-05-07T19:44:03.1962131Z 2025-05-07T19:44:03.2118223Z python_abi-3.10 | 4 KB | ########## | 100%  2025-05-07T19:44:03.2118548Z 2025-05-07T19:44:03.2118553Z 2025-05-07T19:44:03.2119425Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:03.2119712Z 2025-05-07T19:44:03.2119725Z 2025-05-07T19:44:03.2557034Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:03.2557368Z 2025-05-07T19:44:03.2557400Z 2025-05-07T19:44:03.2557404Z 2025-05-07T19:44:03.2557408Z 2025-05-07T19:44:03.2557411Z 2025-05-07T19:44:03.2557428Z 2025-05-07T19:44:03.2558128Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:03.2558452Z 2025-05-07T19:44:03.2558457Z 2025-05-07T19:44:03.2558462Z 2025-05-07T19:44:03.2558465Z 2025-05-07T19:44:03.2558468Z 2025-05-07T19:44:03.2558482Z 2025-05-07T19:44:03.2664485Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:03.2665056Z 2025-05-07T19:44:03.2665338Z 2025-05-07T19:44:03.2665342Z 2025-05-07T19:44:03.2665346Z 2025-05-07T19:44:03.2665349Z 2025-05-07T19:44:03.2665353Z 2025-05-07T19:44:03.2665356Z 2025-05-07T19:44:03.2665359Z 2025-05-07T19:44:03.2955991Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:03.2956583Z 2025-05-07T19:44:03.2956588Z 2025-05-07T19:44:03.2956592Z 2025-05-07T19:44:03.2956596Z 2025-05-07T19:44:03.2956599Z 2025-05-07T19:44:03.2956883Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:03.2957195Z 2025-05-07T19:44:03.2957217Z 2025-05-07T19:44:03.2957221Z 2025-05-07T19:44:03.2957224Z 2025-05-07T19:44:03.2957227Z 2025-05-07T19:44:03.3402726Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:03.3403346Z 2025-05-07T19:44:03.3403355Z 2025-05-07T19:44:03.3403361Z 2025-05-07T19:44:03.3403368Z 2025-05-07T19:44:03.3403377Z 2025-05-07T19:44:03.3403385Z 2025-05-07T19:44:03.3403390Z 2025-05-07T19:44:03.3403416Z 2025-05-07T19:44:03.3403425Z 2025-05-07T19:44:03.3406778Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:03.3407352Z 2025-05-07T19:44:03.3407356Z 2025-05-07T19:44:03.3407359Z 2025-05-07T19:44:03.3407363Z 2025-05-07T19:44:03.3407366Z 2025-05-07T19:44:03.3407370Z 2025-05-07T19:44:03.3407373Z 2025-05-07T19:44:03.3407376Z 2025-05-07T19:44:03.3407389Z 2025-05-07T19:44:03.3519932Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:03.3520475Z 2025-05-07T19:44:03.3520678Z 2025-05-07T19:44:03.3520683Z 2025-05-07T19:44:03.3520687Z 2025-05-07T19:44:03.3520690Z 2025-05-07T19:44:03.3520694Z 2025-05-07T19:44:03.3520697Z 2025-05-07T19:44:03.3520701Z 2025-05-07T19:44:03.3520704Z 2025-05-07T19:44:03.3520708Z 2025-05-07T19:44:03.3658661Z python_abi-3.10 | 4 KB | ########## | 100%  2025-05-07T19:44:03.3659260Z 2025-05-07T19:44:03.3659519Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:03.3659805Z 2025-05-07T19:44:03.3798287Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:03.3799517Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:03.3807503Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:03.3807848Z 2025-05-07T19:44:03.3808113Z 2025-05-07T19:44:03.3808314Z  2025-05-07T19:44:03.3808536Z 2025-05-07T19:44:03.3808540Z 2025-05-07T19:44:03.3808749Z  2025-05-07T19:44:03.3809038Z 2025-05-07T19:44:03.3809060Z 2025-05-07T19:44:03.3809064Z 2025-05-07T19:44:03.3809262Z  2025-05-07T19:44:03.3809483Z 2025-05-07T19:44:03.3809487Z 2025-05-07T19:44:03.3809491Z 2025-05-07T19:44:03.3809494Z 2025-05-07T19:44:03.3809676Z  2025-05-07T19:44:03.3809931Z 2025-05-07T19:44:03.3809935Z 2025-05-07T19:44:03.3809938Z 2025-05-07T19:44:03.3809942Z 2025-05-07T19:44:03.3809946Z 2025-05-07T19:44:03.3810130Z  2025-05-07T19:44:03.3810361Z 2025-05-07T19:44:03.3810364Z 2025-05-07T19:44:03.3810368Z 2025-05-07T19:44:03.3810394Z 2025-05-07T19:44:03.3810397Z 2025-05-07T19:44:03.3810400Z 2025-05-07T19:44:03.3810594Z  2025-05-07T19:44:03.3810829Z 2025-05-07T19:44:03.3810833Z 2025-05-07T19:44:03.3810836Z 2025-05-07T19:44:03.3810839Z 2025-05-07T19:44:03.3810843Z 2025-05-07T19:44:03.3810846Z 2025-05-07T19:44:03.3810849Z 2025-05-07T19:44:03.3811059Z  2025-05-07T19:44:03.3811292Z 2025-05-07T19:44:03.3811295Z 2025-05-07T19:44:03.3811299Z 2025-05-07T19:44:03.3811302Z 2025-05-07T19:44:03.3811536Z 2025-05-07T19:44:03.3811542Z 2025-05-07T19:44:03.3811545Z 2025-05-07T19:44:03.3811548Z 2025-05-07T19:44:03.3811767Z  2025-05-07T19:44:03.3812002Z 2025-05-07T19:44:03.3812006Z 2025-05-07T19:44:03.3812009Z 2025-05-07T19:44:03.3812013Z 2025-05-07T19:44:03.3812016Z 2025-05-07T19:44:03.3812020Z 2025-05-07T19:44:03.3812024Z 2025-05-07T19:44:03.3812027Z 2025-05-07T19:44:03.3812030Z 2025-05-07T19:44:03.3812258Z  2025-05-07T19:44:03.3812493Z 2025-05-07T19:44:03.3812497Z 2025-05-07T19:44:03.3812500Z 2025-05-07T19:44:03.3812503Z 2025-05-07T19:44:03.3812507Z 2025-05-07T19:44:03.3812510Z 2025-05-07T19:44:03.3812514Z 2025-05-07T19:44:03.3812517Z 2025-05-07T19:44:03.3812520Z 2025-05-07T19:44:03.3812524Z 2025-05-07T19:44:03.3812758Z  done 2025-05-07T19:44:03.4817843Z Preparing transaction: \ done 2025-05-07T19:44:03.5827914Z Verifying transaction: / done 2025-05-07T19:44:04.9855619Z Executing transaction: \ | / - \ | / - \ | / - \ | done 2025-05-07T19:44:05.0824018Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:06.7619264Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:06.7625817Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:06.7650726Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:07.4268981Z Channels: 2025-05-07T19:44:07.4269677Z - conda-forge 2025-05-07T19:44:07.4270358Z Platform: linux-64 2025-05-07T19:44:10.5134854Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:10.9415089Z Solving environment: \ done 2025-05-07T19:44:10.9876955Z 2025-05-07T19:44:10.9877974Z ## Package Plan ## 2025-05-07T19:44:10.9878479Z 2025-05-07T19:44:10.9879065Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:10.9880009Z 2025-05-07T19:44:10.9880306Z added / updated specs: 2025-05-07T19:44:10.9880995Z - libxcrypt 2025-05-07T19:44:10.9881374Z 2025-05-07T19:44:10.9881385Z 2025-05-07T19:44:10.9881755Z The following packages will be downloaded: 2025-05-07T19:44:10.9882401Z 2025-05-07T19:44:10.9882733Z package | build 2025-05-07T19:44:10.9883702Z ---------------------------|----------------- 2025-05-07T19:44:10.9884422Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:10.9884831Z ------------------------------------------------------------ 2025-05-07T19:44:10.9885179Z Total: 98 KB 2025-05-07T19:44:10.9885386Z 2025-05-07T19:44:10.9885513Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:10.9885749Z 2025-05-07T19:44:10.9885990Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:10.9886286Z 2025-05-07T19:44:10.9886289Z 2025-05-07T19:44:10.9886293Z 2025-05-07T19:44:10.9886450Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:11.1214000Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:11.1226533Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:11.1347149Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:11.1351789Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:11.1352268Z 2025-05-07T19:44:11.1352672Z done 2025-05-07T19:44:11.2358212Z Preparing transaction: / done 2025-05-07T19:44:11.3367208Z Verifying transaction: \ done 2025-05-07T19:44:11.4378975Z Executing transaction: / done 2025-05-07T19:44:14.7162940Z [SETUP] Copying over ... 2025-05-07T19:44:14.7163803Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.10/crypt.h 2025-05-07T19:44:14.7164775Z 2025-05-07T19:44:14.7191079Z 2025-05-07T19:44:16.3266959Z [SETUP] Installed Python version: Python 3.10.16 2025-05-07T19:44:16.3268288Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:16.3341415Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV gcc 2025-05-07T19:44:16.3341916Z . $PRELUDE; install_cxx_compiler $BUILD_ENV gcc 2025-05-07T19:44:16.3342515Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:16.3342830Z env: 2025-05-07T19:44:16.3343067Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:16.3343366Z BUILD_ENV: build_binary 2025-05-07T19:44:16.3343663Z BUILD_TARGET: default 2025-05-07T19:44:16.3343926Z BUILD_VARIANT: cuda 2025-05-07T19:44:16.3344178Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:16.3344423Z ##[endgroup] 2025-05-07T19:44:16.7629660Z ################################################################################ 2025-05-07T19:44:16.7630725Z # Install C/C++ Compilers 2025-05-07T19:44:16.7631443Z # 2025-05-07T19:44:16.7648244Z # [2025-05-07T19:44:16.764Z] + install_cxx_compiler build_binary gcc 2025-05-07T19:44:16.7649210Z ################################################################################ 2025-05-07T19:44:16.7649442Z 2025-05-07T19:44:16.7668869Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:16.8508084Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:16.8516333Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:16.8547232Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:17.5159470Z Channels: 2025-05-07T19:44:17.5159761Z - conda-forge 2025-05-07T19:44:17.5159992Z Platform: linux-64 2025-05-07T19:44:20.5658832Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:20.9860130Z Solving environment: \ done 2025-05-07T19:44:21.0318043Z 2025-05-07T19:44:21.0318373Z ## Package Plan ## 2025-05-07T19:44:21.0318639Z 2025-05-07T19:44:21.0318847Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:21.0319163Z 2025-05-07T19:44:21.0319278Z added / updated specs: 2025-05-07T19:44:21.0319561Z - sysroot_linux-64=2.17 2025-05-07T19:44:21.0319729Z 2025-05-07T19:44:21.0319734Z 2025-05-07T19:44:21.0319872Z The following packages will be downloaded: 2025-05-07T19:44:21.0320092Z 2025-05-07T19:44:21.0320208Z package | build 2025-05-07T19:44:21.0320550Z ---------------------------|----------------- 2025-05-07T19:44:21.0320978Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:21.0321499Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:21.0321937Z ------------------------------------------------------------ 2025-05-07T19:44:21.0322284Z Total: 15.4 MB 2025-05-07T19:44:21.0322513Z 2025-05-07T19:44:21.0322648Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:21.0322877Z 2025-05-07T19:44:21.0323185Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:21.0323794Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:21.0324117Z 2025-05-07T19:44:21.0324121Z 2025-05-07T19:44:21.0324125Z 2025-05-07T19:44:21.0324284Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:21.0324675Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:21.0324933Z 2025-05-07T19:44:21.2199127Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:21.3052667Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:21.3053475Z 2025-05-07T19:44:21.3195447Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:21.3196531Z 2025-05-07T19:44:21.3319380Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:21.3932563Z sysroot_linux-64-2.1 | 14.5 MB | #########7 | 98% 2025-05-07T19:44:21.5838222Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:21.5838521Z 2025-05-07T19:44:21.5839232Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:21.5839512Z 2025-05-07T19:44:21.9481046Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:21.9483764Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:21.9484112Z 2025-05-07T19:44:21.9484340Z 2025-05-07T19:44:21.9487923Z  done 2025-05-07T19:44:22.0494920Z Preparing transaction: / done 2025-05-07T19:44:22.2504838Z Verifying transaction: \ | done 2025-05-07T19:44:22.3515463Z Executing transaction: - done 2025-05-07T19:44:22.4356980Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:22.4357391Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:24.0719502Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:24.0729408Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:24.0753246Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:24.7619407Z Channels: 2025-05-07T19:44:24.7620065Z - conda-forge 2025-05-07T19:44:24.7620723Z Platform: linux-64 2025-05-07T19:44:27.9467375Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:29.0810065Z Solving environment: \ | / - done 2025-05-07T19:44:29.1312643Z 2025-05-07T19:44:29.1313390Z ## Package Plan ## 2025-05-07T19:44:29.1313912Z 2025-05-07T19:44:29.1314497Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:29.1315400Z 2025-05-07T19:44:29.1315675Z added / updated specs: 2025-05-07T19:44:29.1316640Z - gxx_linux-64=11.4.0 2025-05-07T19:44:29.1317103Z 2025-05-07T19:44:29.1317116Z 2025-05-07T19:44:29.1317519Z The following packages will be downloaded: 2025-05-07T19:44:29.1318166Z 2025-05-07T19:44:29.1318570Z package | build 2025-05-07T19:44:29.1319558Z ---------------------------|----------------- 2025-05-07T19:44:29.1320525Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:29.1321039Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:29.1321532Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:29.1321994Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:29.1322458Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:29.1322904Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:29.1323358Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:29.1323848Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:29.1324341Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:29.1324811Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:29.1325290Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:29.1325796Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:29.1326208Z ------------------------------------------------------------ 2025-05-07T19:44:29.1326571Z Total: 91.6 MB 2025-05-07T19:44:29.1326785Z 2025-05-07T19:44:29.1326932Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:29.1327156Z 2025-05-07T19:44:29.1327448Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:29.1328040Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:29.1328881Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:29.1329600Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:29.1330138Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:29.1330658Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:29.1331222Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:29.1331794Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:29.1332313Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:29.1332886Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:29.1333370Z 2025-05-07T19:44:29.1333481Z The following packages will be UPDATED: 2025-05-07T19:44:29.1333689Z 2025-05-07T19:44:29.1353863Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:29.1355038Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:29.1355460Z 2025-05-07T19:44:29.1355463Z 2025-05-07T19:44:29.1355467Z 2025-05-07T19:44:29.1355612Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:29.1356342Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:29.1356592Z 2025-05-07T19:44:29.1356997Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:29.1357254Z 2025-05-07T19:44:29.1357257Z 2025-05-07T19:44:29.1357491Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:29.1357784Z 2025-05-07T19:44:29.1357788Z 2025-05-07T19:44:29.1357791Z 2025-05-07T19:44:29.1358029Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:29.1358308Z 2025-05-07T19:44:29.1358322Z 2025-05-07T19:44:29.1358325Z 2025-05-07T19:44:29.1358329Z 2025-05-07T19:44:29.1358601Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:29.1358880Z 2025-05-07T19:44:29.1358892Z 2025-05-07T19:44:29.1358896Z 2025-05-07T19:44:29.1358899Z 2025-05-07T19:44:29.1358902Z 2025-05-07T19:44:29.1359157Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:29.1359468Z 2025-05-07T19:44:29.1359471Z 2025-05-07T19:44:29.1359475Z 2025-05-07T19:44:29.1359478Z 2025-05-07T19:44:29.1359482Z 2025-05-07T19:44:29.1359485Z 2025-05-07T19:44:29.1359751Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:29.1360068Z 2025-05-07T19:44:29.1360072Z 2025-05-07T19:44:29.1360075Z 2025-05-07T19:44:29.1360079Z 2025-05-07T19:44:29.1360082Z 2025-05-07T19:44:29.1360085Z 2025-05-07T19:44:29.1360089Z 2025-05-07T19:44:29.1360342Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:29.1360640Z 2025-05-07T19:44:29.1360643Z 2025-05-07T19:44:29.1360665Z 2025-05-07T19:44:29.1360668Z 2025-05-07T19:44:29.1360672Z 2025-05-07T19:44:29.1360675Z 2025-05-07T19:44:29.1360678Z 2025-05-07T19:44:29.1360687Z 2025-05-07T19:44:29.1360950Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:29.1361246Z 2025-05-07T19:44:29.1361249Z 2025-05-07T19:44:29.1361253Z 2025-05-07T19:44:29.1361256Z 2025-05-07T19:44:29.1361276Z 2025-05-07T19:44:29.1361280Z 2025-05-07T19:44:29.1361283Z 2025-05-07T19:44:29.1361286Z 2025-05-07T19:44:29.1361290Z 2025-05-07T19:44:29.1361544Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:29.1361831Z 2025-05-07T19:44:29.1361834Z 2025-05-07T19:44:29.1361839Z 2025-05-07T19:44:29.1361843Z 2025-05-07T19:44:29.1361847Z 2025-05-07T19:44:29.1361870Z 2025-05-07T19:44:29.1361873Z 2025-05-07T19:44:29.1361877Z 2025-05-07T19:44:29.1361880Z 2025-05-07T19:44:29.1361883Z 2025-05-07T19:44:29.1362376Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:29.1362786Z 2025-05-07T19:44:29.1362789Z 2025-05-07T19:44:29.1362793Z 2025-05-07T19:44:29.1362901Z 2025-05-07T19:44:29.1362906Z 2025-05-07T19:44:29.1362927Z 2025-05-07T19:44:29.1362931Z 2025-05-07T19:44:29.1362934Z 2025-05-07T19:44:29.1362937Z 2025-05-07T19:44:29.1362940Z 2025-05-07T19:44:29.1362944Z 2025-05-07T19:44:29.2473220Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:29.2474202Z 2025-05-07T19:44:29.2474216Z 2025-05-07T19:44:29.2474228Z 2025-05-07T19:44:29.2474240Z 2025-05-07T19:44:29.4335834Z libstdcxx-15.1.0 | 3.7 MB | #1 | 12%  2025-05-07T19:44:29.4336720Z 2025-05-07T19:44:29.4336734Z 2025-05-07T19:44:29.4336745Z 2025-05-07T19:44:29.4336755Z 2025-05-07T19:44:29.4643649Z libstdcxx-15.1.0 | 3.7 MB | ##2 | 22%  2025-05-07T19:44:29.4644557Z 2025-05-07T19:44:29.4644606Z 2025-05-07T19:44:29.4644617Z 2025-05-07T19:44:29.4644629Z 2025-05-07T19:44:29.4715271Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:29.4716492Z 2025-05-07T19:44:29.4716539Z 2025-05-07T19:44:29.4716551Z 2025-05-07T19:44:29.4746893Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:29.4747798Z 2025-05-07T19:44:29.4747812Z 2025-05-07T19:44:29.4854751Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:29.4855068Z 2025-05-07T19:44:29.4963959Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:29.4964815Z 2025-05-07T19:44:29.4964830Z 2025-05-07T19:44:29.4964842Z 2025-05-07T19:44:29.4964852Z 2025-05-07T19:44:29.4964863Z 2025-05-07T19:44:29.5610817Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:29.5611451Z 2025-05-07T19:44:29.5611477Z 2025-05-07T19:44:29.5611481Z 2025-05-07T19:44:29.5611485Z 2025-05-07T19:44:29.5611490Z 2025-05-07T19:44:29.5648933Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:29.5747677Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:29.5748469Z 2025-05-07T19:44:29.5748514Z 2025-05-07T19:44:29.5854722Z libstdcxx-devel_linu | 11.1 MB | #########9 | 100%  2025-05-07T19:44:29.5855582Z 2025-05-07T19:44:29.5862640Z gxx_impl_linux-64-11 | 11.2 MB | #########4 | 94%  2025-05-07T19:44:29.5863408Z 2025-05-07T19:44:29.5863422Z 2025-05-07T19:44:29.5863432Z 2025-05-07T19:44:29.5864153Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:29.5864957Z 2025-05-07T19:44:29.5864968Z 2025-05-07T19:44:29.5864979Z 2025-05-07T19:44:29.6019410Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:29.6019745Z 2025-05-07T19:44:29.6019750Z 2025-05-07T19:44:29.6019753Z 2025-05-07T19:44:29.6019757Z 2025-05-07T19:44:29.6019760Z 2025-05-07T19:44:29.6019764Z 2025-05-07T19:44:29.6108581Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:29.6109609Z 2025-05-07T19:44:29.6109624Z 2025-05-07T19:44:29.6109635Z 2025-05-07T19:44:29.6109646Z 2025-05-07T19:44:29.6110380Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:29.6111185Z 2025-05-07T19:44:29.6111196Z 2025-05-07T19:44:29.6111207Z 2025-05-07T19:44:29.6111217Z 2025-05-07T19:44:29.6193500Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:29.6193963Z 2025-05-07T19:44:29.6193967Z 2025-05-07T19:44:29.6193971Z 2025-05-07T19:44:29.6193975Z 2025-05-07T19:44:29.6193978Z 2025-05-07T19:44:29.6193982Z 2025-05-07T19:44:29.6193985Z 2025-05-07T19:44:29.6292358Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:29.6293306Z 2025-05-07T19:44:29.6293321Z 2025-05-07T19:44:29.6293332Z 2025-05-07T19:44:29.6293343Z 2025-05-07T19:44:29.6293354Z 2025-05-07T19:44:29.6293364Z 2025-05-07T19:44:29.6293374Z 2025-05-07T19:44:29.6558636Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:29.6559580Z 2025-05-07T19:44:29.6559594Z 2025-05-07T19:44:29.6559605Z 2025-05-07T19:44:29.6559616Z 2025-05-07T19:44:29.6560052Z 2025-05-07T19:44:29.6560066Z 2025-05-07T19:44:29.6639442Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:29.6639792Z 2025-05-07T19:44:29.6639797Z 2025-05-07T19:44:29.6639801Z 2025-05-07T19:44:29.6639805Z 2025-05-07T19:44:29.6639808Z 2025-05-07T19:44:29.6639812Z 2025-05-07T19:44:29.6639815Z 2025-05-07T19:44:29.6648979Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:29.6676942Z gcc_impl_linux-64-11 | 53.0 MB | 4 | 5% 2025-05-07T19:44:29.6677766Z 2025-05-07T19:44:29.6677781Z 2025-05-07T19:44:29.6677792Z 2025-05-07T19:44:29.6677802Z 2025-05-07T19:44:29.6677813Z 2025-05-07T19:44:29.6677824Z 2025-05-07T19:44:29.6677834Z 2025-05-07T19:44:29.6677845Z 2025-05-07T19:44:29.6685545Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:29.6685888Z 2025-05-07T19:44:29.6685892Z 2025-05-07T19:44:29.6685896Z 2025-05-07T19:44:29.6685900Z 2025-05-07T19:44:29.6685912Z 2025-05-07T19:44:29.6685916Z 2025-05-07T19:44:29.6685919Z 2025-05-07T19:44:29.6685923Z 2025-05-07T19:44:29.6798505Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:29.6799477Z 2025-05-07T19:44:29.6799492Z 2025-05-07T19:44:29.6944156Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:29.6944692Z 2025-05-07T19:44:29.6944752Z 2025-05-07T19:44:29.6944759Z 2025-05-07T19:44:29.6944763Z 2025-05-07T19:44:29.6944766Z 2025-05-07T19:44:29.6944770Z 2025-05-07T19:44:29.6944786Z 2025-05-07T19:44:29.6944790Z 2025-05-07T19:44:29.6944793Z 2025-05-07T19:44:29.6952676Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:29.6953005Z 2025-05-07T19:44:29.6953028Z 2025-05-07T19:44:29.6953050Z 2025-05-07T19:44:29.6953054Z 2025-05-07T19:44:29.6953058Z 2025-05-07T19:44:29.6953061Z 2025-05-07T19:44:29.6953083Z 2025-05-07T19:44:29.6953087Z 2025-05-07T19:44:29.6953091Z 2025-05-07T19:44:29.6967977Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:29.6968867Z 2025-05-07T19:44:29.6968879Z 2025-05-07T19:44:29.6968889Z 2025-05-07T19:44:29.6968899Z 2025-05-07T19:44:29.6968909Z 2025-05-07T19:44:29.6968940Z 2025-05-07T19:44:29.6968951Z 2025-05-07T19:44:29.6968961Z 2025-05-07T19:44:29.6968972Z 2025-05-07T19:44:29.6969193Z 2025-05-07T19:44:29.6977249Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:29.6978216Z 2025-05-07T19:44:29.6985752Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:29.6986009Z 2025-05-07T19:44:29.6986029Z 2025-05-07T19:44:29.6986033Z 2025-05-07T19:44:29.6986038Z 2025-05-07T19:44:29.6986041Z 2025-05-07T19:44:29.6986071Z 2025-05-07T19:44:29.6986074Z 2025-05-07T19:44:29.6986096Z 2025-05-07T19:44:29.6986100Z 2025-05-07T19:44:29.6986278Z 2025-05-07T19:44:29.7087047Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:29.7087993Z 2025-05-07T19:44:29.7088007Z 2025-05-07T19:44:29.7088018Z 2025-05-07T19:44:29.7088029Z 2025-05-07T19:44:29.7088039Z 2025-05-07T19:44:29.7088049Z 2025-05-07T19:44:29.7088059Z 2025-05-07T19:44:29.7088070Z 2025-05-07T19:44:29.7088080Z 2025-05-07T19:44:29.7088090Z 2025-05-07T19:44:29.7088101Z 2025-05-07T19:44:29.7095892Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:29.7096800Z 2025-05-07T19:44:29.7096811Z 2025-05-07T19:44:29.7096847Z 2025-05-07T19:44:29.7096858Z 2025-05-07T19:44:29.7096868Z 2025-05-07T19:44:29.7096879Z 2025-05-07T19:44:29.7096890Z 2025-05-07T19:44:29.7096923Z 2025-05-07T19:44:29.7096934Z 2025-05-07T19:44:29.7096944Z 2025-05-07T19:44:29.7096954Z 2025-05-07T19:44:29.7247844Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:29.7248850Z 2025-05-07T19:44:29.7248864Z 2025-05-07T19:44:29.7248875Z 2025-05-07T19:44:29.7249380Z 2025-05-07T19:44:29.7249386Z 2025-05-07T19:44:29.7249674Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:29.7249973Z 2025-05-07T19:44:29.7249977Z 2025-05-07T19:44:29.7249980Z 2025-05-07T19:44:29.7249984Z 2025-05-07T19:44:29.7249988Z 2025-05-07T19:44:29.7407314Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:29.7408270Z 2025-05-07T19:44:29.7408285Z 2025-05-07T19:44:29.7408296Z 2025-05-07T19:44:29.7408307Z 2025-05-07T19:44:29.7408317Z 2025-05-07T19:44:29.7408328Z 2025-05-07T19:44:29.7409149Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:29.7410032Z 2025-05-07T19:44:29.7410044Z 2025-05-07T19:44:29.7410054Z 2025-05-07T19:44:29.7410064Z 2025-05-07T19:44:29.7410214Z 2025-05-07T19:44:29.7410218Z 2025-05-07T19:44:29.7525529Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:29.7526518Z 2025-05-07T19:44:29.7526532Z 2025-05-07T19:44:29.7526574Z 2025-05-07T19:44:29.7526585Z 2025-05-07T19:44:29.7526595Z 2025-05-07T19:44:29.7526605Z 2025-05-07T19:44:29.7526615Z 2025-05-07T19:44:29.7526625Z 2025-05-07T19:44:29.7664546Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:29.7809718Z gcc_impl_linux-64-11 | 53.0 MB | #4 | 15% 2025-05-07T19:44:29.7810331Z 2025-05-07T19:44:29.7810337Z 2025-05-07T19:44:29.7810340Z 2025-05-07T19:44:29.7810344Z 2025-05-07T19:44:29.7810348Z 2025-05-07T19:44:29.7810352Z 2025-05-07T19:44:29.7810376Z 2025-05-07T19:44:29.7810381Z 2025-05-07T19:44:29.7810385Z 2025-05-07T19:44:29.8844846Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:29.8996498Z gcc_impl_linux-64-11 | 53.0 MB | ## | 21% 2025-05-07T19:44:29.8997324Z 2025-05-07T19:44:29.8997339Z 2025-05-07T19:44:29.8997352Z 2025-05-07T19:44:29.9218879Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:29.9219849Z 2025-05-07T19:44:29.9219865Z 2025-05-07T19:44:29.9219876Z 2025-05-07T19:44:29.9219887Z 2025-05-07T19:44:29.9219897Z 2025-05-07T19:44:29.9219908Z 2025-05-07T19:44:29.9219918Z 2025-05-07T19:44:29.9219929Z 2025-05-07T19:44:29.9219939Z 2025-05-07T19:44:29.9219949Z 2025-05-07T19:44:29.9220746Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:29.9221641Z 2025-05-07T19:44:29.9221653Z 2025-05-07T19:44:29.9221663Z 2025-05-07T19:44:29.9221673Z 2025-05-07T19:44:29.9221683Z 2025-05-07T19:44:29.9221694Z 2025-05-07T19:44:29.9221705Z 2025-05-07T19:44:29.9221715Z 2025-05-07T19:44:29.9221726Z 2025-05-07T19:44:29.9221736Z 2025-05-07T19:44:29.9432287Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:29.9433315Z 2025-05-07T19:44:29.9433329Z 2025-05-07T19:44:29.9433340Z 2025-05-07T19:44:29.9433350Z 2025-05-07T19:44:29.9433361Z 2025-05-07T19:44:29.9433372Z 2025-05-07T19:44:29.9433382Z 2025-05-07T19:44:29.9433409Z 2025-05-07T19:44:29.9433420Z 2025-05-07T19:44:29.9433430Z 2025-05-07T19:44:29.9433440Z 2025-05-07T19:44:29.9434307Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:29.9435208Z 2025-05-07T19:44:29.9435219Z 2025-05-07T19:44:29.9435230Z 2025-05-07T19:44:29.9435241Z 2025-05-07T19:44:29.9435251Z 2025-05-07T19:44:29.9435261Z 2025-05-07T19:44:29.9435271Z 2025-05-07T19:44:29.9435281Z 2025-05-07T19:44:29.9435291Z 2025-05-07T19:44:29.9435301Z 2025-05-07T19:44:29.9435311Z 2025-05-07T19:44:29.9868227Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:29.9944653Z gcc_impl_linux-64-11 | 53.0 MB | ##7 | 28% 2025-05-07T19:44:29.9945469Z 2025-05-07T19:44:30.1277873Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:30.1339155Z gcc_impl_linux-64-11 | 53.0 MB | ###3 | 34% 2025-05-07T19:44:30.1339933Z 2025-05-07T19:44:30.1339949Z 2025-05-07T19:44:30.2320798Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:30.5008046Z gcc_impl_linux-64-11 | 53.0 MB | ###9 | 40% 2025-05-07T19:44:30.6425015Z gcc_impl_linux-64-11 | 53.0 MB | ####5 | 45% 2025-05-07T19:44:30.7751294Z gcc_impl_linux-64-11 | 53.0 MB | ####9 | 50% 2025-05-07T19:44:30.9716991Z gcc_impl_linux-64-11 | 53.0 MB | #####3 | 54% 2025-05-07T19:44:31.0720724Z gcc_impl_linux-64-11 | 53.0 MB | #####7 | 58% 2025-05-07T19:44:31.1748770Z gcc_impl_linux-64-11 | 53.0 MB | ######1 | 61% 2025-05-07T19:44:31.2771691Z gcc_impl_linux-64-11 | 53.0 MB | ######4 | 64% 2025-05-07T19:44:31.7152014Z gcc_impl_linux-64-11 | 53.0 MB | ####### | 71% 2025-05-07T19:44:31.8164651Z gcc_impl_linux-64-11 | 53.0 MB | #######4 | 75% 2025-05-07T19:44:31.9881789Z gcc_impl_linux-64-11 | 53.0 MB | ######## | 80% 2025-05-07T19:44:32.1295356Z gcc_impl_linux-64-11 | 53.0 MB | ########3 | 84% 2025-05-07T19:44:32.2295266Z gcc_impl_linux-64-11 | 53.0 MB | ########7 | 87% 2025-05-07T19:44:32.4098654Z gcc_impl_linux-64-11 | 53.0 MB | #########7 | 97% 2025-05-07T19:44:32.9474193Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:32.9480885Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:32.9481847Z 2025-05-07T19:44:32.9482065Z 2025-05-07T19:44:32.9482454Z  2025-05-07T19:44:32.9482675Z 2025-05-07T19:44:32.9482679Z 2025-05-07T19:44:32.9482859Z  2025-05-07T19:44:32.9483100Z 2025-05-07T19:44:32.9483104Z 2025-05-07T19:44:32.9483110Z 2025-05-07T19:44:32.9483291Z  2025-05-07T19:44:32.9483547Z 2025-05-07T19:44:32.9483551Z 2025-05-07T19:44:32.9483554Z 2025-05-07T19:44:32.9483559Z 2025-05-07T19:44:32.9483775Z  2025-05-07T19:44:32.9483999Z 2025-05-07T19:44:32.9484003Z 2025-05-07T19:44:32.9484006Z 2025-05-07T19:44:32.9484010Z 2025-05-07T19:44:32.9484013Z 2025-05-07T19:44:32.9484208Z  2025-05-07T19:44:32.9484454Z 2025-05-07T19:44:32.9484458Z 2025-05-07T19:44:32.9484461Z 2025-05-07T19:44:32.9484465Z 2025-05-07T19:44:32.9484468Z 2025-05-07T19:44:32.9484472Z 2025-05-07T19:44:32.9484664Z  2025-05-07T19:44:32.9484908Z 2025-05-07T19:44:32.9484912Z 2025-05-07T19:44:32.9484915Z 2025-05-07T19:44:32.9484919Z 2025-05-07T19:44:32.9484922Z 2025-05-07T19:44:32.9484926Z 2025-05-07T19:44:32.9484929Z 2025-05-07T19:44:32.9485122Z  2025-05-07T19:44:32.9485357Z 2025-05-07T19:44:32.9485361Z 2025-05-07T19:44:32.9485382Z 2025-05-07T19:44:32.9485386Z 2025-05-07T19:44:32.9485389Z 2025-05-07T19:44:32.9485398Z 2025-05-07T19:44:32.9485402Z 2025-05-07T19:44:32.9485405Z 2025-05-07T19:44:32.9485596Z  2025-05-07T19:44:32.9485826Z 2025-05-07T19:44:32.9485831Z 2025-05-07T19:44:32.9485835Z 2025-05-07T19:44:32.9485838Z 2025-05-07T19:44:32.9485859Z 2025-05-07T19:44:32.9485862Z 2025-05-07T19:44:32.9485866Z 2025-05-07T19:44:32.9485869Z 2025-05-07T19:44:32.9485873Z 2025-05-07T19:44:32.9486065Z  2025-05-07T19:44:32.9486298Z 2025-05-07T19:44:32.9486302Z 2025-05-07T19:44:32.9486307Z 2025-05-07T19:44:32.9486311Z 2025-05-07T19:44:32.9486314Z 2025-05-07T19:44:32.9486335Z 2025-05-07T19:44:32.9486339Z 2025-05-07T19:44:32.9486343Z 2025-05-07T19:44:32.9486757Z 2025-05-07T19:44:32.9486761Z 2025-05-07T19:44:32.9486976Z  2025-05-07T19:44:32.9487213Z 2025-05-07T19:44:32.9487363Z 2025-05-07T19:44:32.9487367Z 2025-05-07T19:44:32.9487371Z 2025-05-07T19:44:32.9487374Z 2025-05-07T19:44:32.9487393Z 2025-05-07T19:44:32.9487396Z 2025-05-07T19:44:32.9487400Z 2025-05-07T19:44:32.9487403Z 2025-05-07T19:44:32.9487406Z 2025-05-07T19:44:32.9487409Z 2025-05-07T19:44:32.9487631Z  done 2025-05-07T19:44:33.0491486Z Preparing transaction: | done 2025-05-07T19:44:33.2502871Z Verifying transaction: - \ done 2025-05-07T19:44:33.3518145Z Executing transaction: / done 2025-05-07T19:44:33.4438332Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:37.1196829Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:37.1197518Z 2025-05-07T19:44:37.1206979Z 2025-05-07T19:44:37.1227815Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:37.1228419Z 2025-05-07T19:44:37.1239800Z 2025-05-07T19:44:37.1258823Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:37.1260611Z 2025-05-07T19:44:37.1271460Z 2025-05-07T19:44:37.1298285Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:37.1299067Z 2025-05-07T19:44:37.1317260Z 2025-05-07T19:44:38.9392986Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:38.9393477Z 2025-05-07T19:44:38.9998230Z [CHECK] Binary cc found in PATH 2025-05-07T19:44:40.7750284Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:40.7751098Z 2025-05-07T19:44:40.8324042Z [CHECK] Binary gcc found in PATH 2025-05-07T19:44:42.5923799Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:42.5924733Z 2025-05-07T19:44:42.6487559Z [CHECK] Binary c++ found in PATH 2025-05-07T19:44:44.4350268Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:44.4351063Z 2025-05-07T19:44:44.4949255Z [CHECK] Binary g++ found in PATH 2025-05-07T19:44:44.4950503Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:44:44.4951753Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:44:44.4952414Z 2025-05-07T19:44:46.2598178Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:44:46.2598627Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:44:46.2598984Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:44:46.2599280Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:44:46.2599674Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:44:46.2600063Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:44:46.2600402Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:44:46.2600779Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:44:46.2601100Z #define __INTMAX_C(c) c ## L 2025-05-07T19:44:46.2601384Z #define __CHAR_BIT__ 8 2025-05-07T19:44:46.2601708Z #define __UINT8_MAX__ 0xff 2025-05-07T19:44:46.2602019Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:44:46.2602296Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:44:46.2602636Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:44:46.2602938Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:44:46.2603292Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2603626Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:44:46.2603971Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:44:46.2604325Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:44:46.2604700Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:44:46.2605144Z #define __DBL_DENORM_MIN__ ((double)4.94065645841246544176568792868221372e-324L) 2025-05-07T19:44:46.2605626Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:44:46.2606304Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:44:46.2606627Z #define __GCC_IEC_559 2 2025-05-07T19:44:46.2606903Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:44:46.2607414Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:44:46.2607740Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:44:46.2608165Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:44:46.2608554Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2609014Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:44:46.2609321Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.2609612Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:44:46.2609924Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:44:46.2610200Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:44:46.2610498Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:44:46.2610774Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:44:46.2611075Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:44:46.2611368Z #define __INT8_C(c) c 2025-05-07T19:44:46.2611621Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:44:46.2611952Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2612274Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:44:46.2612624Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:46.2612982Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:44:46.2613288Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:44:46.2613558Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2613862Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:44:46.2614173Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:44:46.2614569Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:44:46.2615021Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:44:46.2615319Z #define __linux 1 2025-05-07T19:44:46.2615583Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:44:46.2615874Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:44:46.2616196Z #define __unix 1 2025-05-07T19:44:46.2616432Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:44:46.2616751Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:44:46.2617035Z #define __WINT_MIN__ 0U 2025-05-07T19:44:46.2617334Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.2617648Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:44:46.2617929Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:44:46.2618233Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:44:46.2618497Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:44:46.2618813Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:44:46.2619126Z #define __INT64_C(c) c ## L 2025-05-07T19:44:46.2619431Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:44:46.2619738Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:44:46.2620040Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:44:46.2620401Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:44:46.2620809Z #define __STDC_HOSTED__ 1 2025-05-07T19:44:46.2621098Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:44:46.2621375Z #define __DBL_DIG__ 15 2025-05-07T19:44:46.2621637Z #define __FLT32_DIG__ 6 2025-05-07T19:44:46.2621943Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:44:46.2622329Z #define __SHRT_WIDTH__ 16 2025-05-07T19:44:46.2622582Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:44:46.2622940Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:44:46.2623294Z #define __STDC_UTF_16__ 1 2025-05-07T19:44:46.2623577Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:44:46.2623869Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:44:46.2624253Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:44:46.2624686Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:44:46.2624969Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:44:46.2625269Z #define __unix__ 1 2025-05-07T19:44:46.2625503Z #define __INT_WIDTH__ 32 2025-05-07T19:44:46.2625784Z #define __SIZEOF_LONG__ 8 2025-05-07T19:44:46.2626038Z #define __STDC_IEC_559__ 1 2025-05-07T19:44:46.2626428Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:44:46.2626705Z #define __UINT16_C(c) c 2025-05-07T19:44:46.2626983Z #define __DECIMAL_DIG__ 21 2025-05-07T19:44:46.2627357Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:44:46.2627716Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:44:46.2628109Z #define __gnu_linux__ 1 2025-05-07T19:44:46.2628355Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:44:46.2628659Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.2628958Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2629265Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:44:46.2629530Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:44:46.2629810Z #define __GNUC__ 11 2025-05-07T19:44:46.2630035Z #define __pie__ 2 2025-05-07T19:44:46.2630279Z #define __MMX__ 1 2025-05-07T19:44:46.2630536Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:44:46.2630815Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:44:46.2631125Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:44:46.2631413Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:44:46.2631800Z #define __DBL_MAX__ ((double)1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:46.2632218Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2632561Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:44:46.2632829Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:44:46.2633123Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:44:46.2633430Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:44:46.2633729Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:44:46.2634018Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:44:46.2634312Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:44:46.2634642Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:44:46.2634918Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:44:46.2635233Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:44:46.2635496Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:44:46.2635906Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:44:46.2636378Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:44:46.2636862Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:44:46.2637179Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:44:46.2637560Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:46.2637957Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:44:46.2638294Z #define __SSE2_MATH__ 1 2025-05-07T19:44:46.2638567Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:44:46.2638932Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2639261Z #define __amd64 1 2025-05-07T19:44:46.2639542Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:44:46.2639841Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:44:46.2640202Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:44:46.2640577Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:44:46.2640873Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:44:46.2641205Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:44:46.2641492Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:44:46.2641822Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:44:46.2642108Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:44:46.2642539Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:44:46.2642814Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:44:46.2643140Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:44:46.2643396Z #define __x86_64 1 2025-05-07T19:44:46.2643659Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:44:46.2644057Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:44:46.2644521Z #define __DBL_MIN__ ((double)2.22507385850720138309023271733240406e-308L) 2025-05-07T19:44:46.2645004Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:44:46.2645481Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:46.2645894Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:44:46.2646152Z #define __LP64__ 1 2025-05-07T19:44:46.2646602Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2647159Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:44:46.2647787Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:44:46.2648125Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:44:46.2648430Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.2648869Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:44:46.2649178Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:44:46.2649505Z #define __REGISTER_PREFIX__ 2025-05-07T19:44:46.2649792Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:44:46.2650106Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:44:46.2650395Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:44:46.2650780Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:44:46.2651203Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:44:46.2651504Z #define __FLT_DIG__ 6 2025-05-07T19:44:46.2651788Z #define __NO_INLINE__ 1 2025-05-07T19:44:46.2652055Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:44:46.2652432Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:44:46.2652816Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:44:46.2653248Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:44:46.2653521Z #define __VERSION__ "11.4.0" 2025-05-07T19:44:46.2653812Z #define __UINT64_C(c) c ## UL 2025-05-07T19:44:46.2654081Z #define _STDC_PREDEF_H 1 2025-05-07T19:44:46.2654372Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:44:46.2654694Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:44:46.2654985Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:44:46.2655289Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:44:46.2655597Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:46.2655953Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:44:46.2656226Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:44:46.2656516Z #define __FLT128_DIG__ 33 2025-05-07T19:44:46.2656766Z #define __INT32_C(c) c 2025-05-07T19:44:46.2657039Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:44:46.2657322Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:44:46.2657633Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:44:46.2657953Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:44:46.2658278Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:44:46.2658631Z #define unix 1 2025-05-07T19:44:46.2658873Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:44:46.2659218Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2659534Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:44:46.2659902Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:44:46.2660235Z #define __FLT64X_DIG__ 18 2025-05-07T19:44:46.2660515Z #define __INT8_TYPE__ signed char 2025-05-07T19:44:46.2660795Z #define __ELF__ 1 2025-05-07T19:44:46.2661061Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:44:46.2661370Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:44:46.2661647Z #define __FLT_RADIX__ 2 2025-05-07T19:44:46.2661929Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:44:46.2662286Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:44:46.2662675Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:44:46.2662943Z #define __SSE_MATH__ 1 2025-05-07T19:44:46.2663202Z #define __k8 1 2025-05-07T19:44:46.2663512Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:44:46.2663927Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:44:46.2664263Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:44:46.2664568Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:44:46.2664868Z #define __LDBL_DIG__ 18 2025-05-07T19:44:46.2665124Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:44:46.2665423Z #define __x86_64__ 1 2025-05-07T19:44:46.2665668Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:44:46.2666000Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:44:46.2666342Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2666681Z #define __FLT64_DIG__ 15 2025-05-07T19:44:46.2666965Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2667342Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:44:46.2667696Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2668074Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:44:46.2668396Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2668709Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:44:46.2671119Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:44:46.2671589Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:44:46.2671926Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:44:46.2672275Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:44:46.2672641Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:44:46.2672983Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:44:46.2673302Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:44:46.2673620Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:44:46.2673951Z #define __SIZE_WIDTH__ 64 2025-05-07T19:44:46.2674206Z #define __SEG_FS 1 2025-05-07T19:44:46.2674486Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:44:46.2674777Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:44:46.2675102Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2675396Z #define __SEG_GS 1 2025-05-07T19:44:46.2675866Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:44:46.2676501Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:44:46.2676908Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:44:46.2677278Z #define __INT16_TYPE__ short int 2025-05-07T19:44:46.2677594Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:44:46.2677956Z #define __STDC_VERSION__ 201710L 2025-05-07T19:44:46.2678258Z #define __SIZEOF_INT__ 4 2025-05-07T19:44:46.2678573Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:44:46.2678862Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:44:46.2679280Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:46.2679749Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2680070Z #define linux 1 2025-05-07T19:44:46.2680360Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2680680Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:44:46.2681035Z #define __FLT32X_DIG__ 15 2025-05-07T19:44:46.2681323Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:44:46.2681631Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:44:46.2681922Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:44:46.2682319Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:46.2682765Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:44:46.2683146Z #define __code_model_small__ 1 2025-05-07T19:44:46.2683463Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:44:46.2683772Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:44:46.2684066Z #define __k8__ 1 2025-05-07T19:44:46.2684311Z #define __INTPTR_TYPE__ long int 2025-05-07T19:44:46.2684648Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:44:46.2684968Z #define __WCHAR_TYPE__ int 2025-05-07T19:44:46.2685262Z #define __pic__ 2 2025-05-07T19:44:46.2685540Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2685901Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:44:46.2686239Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2686627Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:44:46.2687065Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:46.2687462Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:44:46.2687784Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:44:46.2688106Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:44:46.2688471Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:44:46.2688857Z #define __linux__ 1 2025-05-07T19:44:46.2689115Z #define __INT64_TYPE__ long int 2025-05-07T19:44:46.2689391Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:44:46.2689685Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:44:46.2689994Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:44:46.2690266Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:44:46.2690589Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2690930Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:44:46.2691379Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:44:46.2691662Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:44:46.2691999Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:44:46.2692378Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:44:46.2692746Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:46.2693110Z #define __SSE__ 1 2025-05-07T19:44:46.2693368Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:44:46.2693744Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:46.2694099Z #define __amd64__ 1 2025-05-07T19:44:46.2694363Z #define __WINT_WIDTH__ 32 2025-05-07T19:44:46.2694627Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:44:46.2694930Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:44:46.2695210Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:44:46.2695509Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:44:46.2695795Z #define __SIZEOF_INT128__ 16 2025-05-07T19:44:46.2696086Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:44:46.2696373Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:44:46.2696674Z #define __ATOMIC_RELAXED 0 2025-05-07T19:44:46.2697057Z #define __DBL_EPSILON__ ((double)2.22044604925031308084726333618164062e-16L) 2025-05-07T19:44:46.2697524Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:44:46.2697909Z #define _LP64 1 2025-05-07T19:44:46.2698132Z #define __UINT8_C(c) c 2025-05-07T19:44:46.2698399Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:44:46.2698671Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:44:46.2698965Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:44:46.2699244Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:44:46.2699573Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:44:46.2699953Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:46.2700422Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:46.2700833Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2701138Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2701478Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:44:46.2701853Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:44:46.2702264Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:44:46.2702533Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:44:46.2702905Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:44:46.2703314Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:44:46.2703582Z #define __STDC_UTF_32__ 1 2025-05-07T19:44:46.2703866Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:44:46.2704124Z #define __FXSR__ 1 2025-05-07T19:44:46.2704432Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:46.2704883Z #define __DBL_NORM_MAX__ ((double)1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:46.2705301Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:46.2705603Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:44:46.2705866Z #define __UINT32_C(c) c ## U 2025-05-07T19:44:46.2706200Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:44:46.2706550Z #define __INT8_MAX__ 0x7f 2025-05-07T19:44:46.2706808Z #define __LONG_WIDTH__ 64 2025-05-07T19:44:46.2707042Z #define __PIC__ 2 2025-05-07T19:44:46.2707310Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:44:46.2707705Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:46.2708109Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:44:46.2708438Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:46.2708788Z #define __SSE2__ 1 2025-05-07T19:44:46.2709030Z #define __INT32_TYPE__ int 2025-05-07T19:44:46.2709280Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:44:46.2709557Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:44:46.2709889Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:44:46.2710267Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:44:46.2710618Z #define __INTMAX_TYPE__ long int 2025-05-07T19:44:46.2710906Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:44:46.2711171Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2711540Z #define __ATOMIC_CONSUME 1 2025-05-07T19:44:46.2711784Z #define __GNUC_MINOR__ 4 2025-05-07T19:44:46.2712049Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:44:46.2712347Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2712637Z #define __PIE__ 2 2025-05-07T19:44:46.2712970Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:44:46.2713356Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:44:46.2713718Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:44:46.2714087Z #define __INT16_C(c) c 2025-05-07T19:44:46.2714332Z #define __STDC__ 1 2025-05-07T19:44:46.2714553Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:44:46.2714831Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:44:46.2715081Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.2715387Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:44:46.2715844Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:44:46.2716359Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:44:46.2716656Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.2717018Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:44:46.2717315Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:44:46.2717604Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:44:46.2717925Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2718207Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:44:46.2718548Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2719006Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:46.2719420Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:44:46.2719780Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:44:46.2720104Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:44:46.2720401Z #define __ATOMIC_RELEASE 3 2025-05-07T19:44:46.2720580Z 2025-05-07T19:44:46.3177465Z 2025-05-07T19:44:46.3178039Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:44:46.3178753Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:44:46.3179000Z 2025-05-07T19:44:48.0799360Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:44:48.0800321Z #define __cpp_attributes 200809L 2025-05-07T19:44:48.0801332Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:44:48.0802399Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:44:48.0803256Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:44:48.0804018Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:44:48.0804996Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:44:48.0805394Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:44:48.0805705Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:44:48.0806059Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:44:48.0806377Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:44:48.0806698Z #define __INTMAX_C(c) c ## L 2025-05-07T19:44:48.0806957Z #define __CHAR_BIT__ 8 2025-05-07T19:44:48.0807216Z #define __UINT8_MAX__ 0xff 2025-05-07T19:44:48.0807489Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:44:48.0807775Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:44:48.0808062Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:44:48.0808368Z #define __cpp_static_assert 201411L 2025-05-07T19:44:48.0808686Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:44:48.0808996Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.0809326Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:44:48.0809627Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:44:48.0809984Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:44:48.0810319Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:44:48.0810759Z #define __DBL_DENORM_MIN__ double(4.94065645841246544176568792868221372e-324L) 2025-05-07T19:44:48.0811191Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:44:48.0811805Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:44:48.0812128Z #define __GCC_IEC_559 2 2025-05-07T19:44:48.0812387Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:44:48.0812798Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:44:48.0813217Z #define __cpp_binary_literals 201304L 2025-05-07T19:44:48.0813774Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:44:48.0814059Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:44:48.0814392Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:44:48.0814701Z #define __cpp_variadic_templates 200704L 2025-05-07T19:44:48.0815043Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.0815375Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:44:48.0815636Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.0815925Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:44:48.0816195Z #define __cpp_variable_templates 201304L 2025-05-07T19:44:48.0816511Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:44:48.0816785Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:44:48.0817048Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:44:48.0817336Z #define __cpp_rvalue_reference 200610L 2025-05-07T19:44:48.0817683Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:44:48.0818012Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:44:48.0818285Z #define __INT8_C(c) c 2025-05-07T19:44:48.0818520Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:44:48.0818810Z #define __cpp_variadic_using 201611L 2025-05-07T19:44:48.0819127Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.0819470Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:44:48.0819746Z #define __cpp_capture_star_this 201603L 2025-05-07T19:44:48.0820060Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:44:48.0820390Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:48.0820744Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:44:48.0821051Z #define __cpp_if_constexpr 201606L 2025-05-07T19:44:48.0821324Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:44:48.0821609Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.0821892Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:44:48.0822192Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:44:48.0822584Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:44:48.0823014Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:44:48.0823298Z #define __linux 1 2025-05-07T19:44:48.0823550Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:44:48.0823857Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:44:48.0824133Z #define __unix 1 2025-05-07T19:44:48.0824370Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:44:48.0824642Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:44:48.0824941Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:44:48.0825205Z #define __WINT_MIN__ 0U 2025-05-07T19:44:48.0825459Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.0825731Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:44:48.0826011Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:44:48.0826273Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:44:48.0826530Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:44:48.0826815Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:44:48.0827105Z #define __INT64_C(c) c ## L 2025-05-07T19:44:48.0827383Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:44:48.0827672Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:44:48.0827964Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:44:48.0828261Z #define __cpp_aligned_new 201606L 2025-05-07T19:44:48.0828552Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:44:48.0828811Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:44:48.0829163Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:44:48.0829546Z #define __STDC_HOSTED__ 1 2025-05-07T19:44:48.0829788Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:44:48.0830074Z #define __cpp_decltype_auto 201304L 2025-05-07T19:44:48.0830338Z #define __DBL_DIG__ 15 2025-05-07T19:44:48.0830580Z #define __FLT32_DIG__ 6 2025-05-07T19:44:48.0830976Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:44:48.0831341Z #define __GXX_WEAK__ 1 2025-05-07T19:44:48.0831577Z #define __SHRT_WIDTH__ 16 2025-05-07T19:44:48.0831907Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:44:48.0832224Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:44:48.0832581Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:44:48.0832852Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:44:48.0833145Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:44:48.0833488Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:44:48.0833890Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:44:48.0834300Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:44:48.0834568Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:44:48.0834861Z #define __unix__ 1 2025-05-07T19:44:48.0835089Z #define __INT_WIDTH__ 32 2025-05-07T19:44:48.0835341Z #define __SIZEOF_LONG__ 8 2025-05-07T19:44:48.0835595Z #define __STDC_IEC_559__ 1 2025-05-07T19:44:48.0835987Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:44:48.0836457Z #define __UINT16_C(c) c 2025-05-07T19:44:48.0836728Z #define __DECIMAL_DIG__ 21 2025-05-07T19:44:48.0837043Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:44:48.0837412Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:44:48.0837819Z #define __gnu_linux__ 1 2025-05-07T19:44:48.0838068Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:44:48.0838362Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:44:48.0838651Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.0838985Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.0839286Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:44:48.0839557Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:44:48.0839857Z #define __GNUC__ 11 2025-05-07T19:44:48.0840102Z #define __GXX_RTTI 1 2025-05-07T19:44:48.0840378Z #define __pie__ 2 2025-05-07T19:44:48.0840616Z #define __MMX__ 1 2025-05-07T19:44:48.0840883Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:44:48.0841176Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:44:48.0841509Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:44:48.0841802Z #define __STDC_UTF_16__ 1 2025-05-07T19:44:48.0842102Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:44:48.0842544Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:44:48.0842896Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:44:48.0843273Z #define __DBL_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:48.0843663Z #define __cpp_raw_strings 200710L 2025-05-07T19:44:48.0844005Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.0844334Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:44:48.0844628Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:44:48.0844902Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:44:48.0845235Z #define __cpp_fold_expressions 201603L 2025-05-07T19:44:48.0845538Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:44:48.0845838Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:44:48.0846141Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:44:48.0846632Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:44:48.0847177Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:44:48.0847497Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:44:48.0847868Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:44:48.0848155Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:44:48.0848478Z #define __cplusplus 201703L 2025-05-07T19:44:48.0848776Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:44:48.0849125Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:44:48.0849407Z #define __DEPRECATED 1 2025-05-07T19:44:48.0849715Z #define __cpp_rvalue_references 200610L 2025-05-07T19:44:48.0850077Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:44:48.0850360Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:44:48.0850744Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:48.0851144Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:44:48.0851489Z #define __SSE2_MATH__ 1 2025-05-07T19:44:48.0851772Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:44:48.0852292Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.0852623Z #define __amd64 1 2025-05-07T19:44:48.0852909Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:44:48.0853304Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:44:48.0853640Z #define __GNUG__ 11 2025-05-07T19:44:48.0853956Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:44:48.0854303Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:44:48.0854618Z #define __cpp_nsdmi 200809L 2025-05-07T19:44:48.0854910Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:44:48.0855250Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:44:48.0855540Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:44:48.0855878Z #define __cpp_initializer_lists 200806L 2025-05-07T19:44:48.0856200Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:44:48.0856517Z #define __cpp_hex_float 201603L 2025-05-07T19:44:48.0856810Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:44:48.0857119Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:44:48.0857453Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:44:48.0857748Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:44:48.0858069Z #define __x86_64 1 2025-05-07T19:44:48.0858315Z #define __cpp_lambdas 200907L 2025-05-07T19:44:48.0858638Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:44:48.0859041Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:44:48.0859614Z #define __cpp_template_auto 201606L 2025-05-07T19:44:48.0859996Z #define __DBL_MIN__ double(2.22507385850720138309023271733240406e-308L) 2025-05-07T19:44:48.0860616Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:44:48.0861128Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:48.0861532Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:44:48.0861821Z #define __LP64__ 1 2025-05-07T19:44:48.0862059Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.0862436Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:44:48.0862825Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:44:48.0863131Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.0863421Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:44:48.0863732Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:44:48.0864033Z #define __REGISTER_PREFIX__ 2025-05-07T19:44:48.0864300Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:44:48.0864594Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:44:48.0864926Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:44:48.0865319Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:44:48.0865605Z #define __FLT_DIG__ 6 2025-05-07T19:44:48.0865879Z #define __NO_INLINE__ 1 2025-05-07T19:44:48.0866138Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:44:48.0866488Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:44:48.0866847Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:44:48.0867145Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:44:48.0867442Z #define __VERSION__ "11.4.0" 2025-05-07T19:44:48.0867708Z #define __UINT64_C(c) c ## UL 2025-05-07T19:44:48.0868014Z #define __cpp_unicode_characters 201411L 2025-05-07T19:44:48.0868324Z #define _STDC_PREDEF_H 1 2025-05-07T19:44:48.0868624Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:44:48.0868922Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:44:48.0869241Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:44:48.0869519Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:44:48.0869854Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:48.0870201Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:44:48.0870522Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:44:48.0870821Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:44:48.0871085Z #define __FLT128_DIG__ 33 2025-05-07T19:44:48.0871360Z #define __INT32_C(c) c 2025-05-07T19:44:48.0871610Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:44:48.0871919Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:44:48.0872198Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:44:48.0872606Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:44:48.0872919Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:44:48.0873262Z #define unix 1 2025-05-07T19:44:48.0873613Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:44:48.0873908Z #define __cpp_rtti 199711L 2025-05-07T19:44:48.0874207Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:44:48.0874521Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.0874859Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:44:48.0875175Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:44:48.0875543Z #define __FLT64X_DIG__ 18 2025-05-07T19:44:48.0875893Z #define __INT8_TYPE__ signed char 2025-05-07T19:44:48.0876406Z #define __cpp_digit_separators 201309L 2025-05-07T19:44:48.0876714Z #define __ELF__ 1 2025-05-07T19:44:48.0877049Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:44:48.0877389Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:44:48.0877694Z #define __FLT_RADIX__ 2 2025-05-07T19:44:48.0878000Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:44:48.0878394Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:44:48.0878814Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:44:48.0879118Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:44:48.0879445Z #define __k8 1 2025-05-07T19:44:48.0879765Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:44:48.0880197Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:44:48.0880520Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:44:48.0880884Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:44:48.0881197Z #define __LDBL_DIG__ 18 2025-05-07T19:44:48.0881464Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:44:48.0881785Z #define __x86_64__ 1 2025-05-07T19:44:48.0882051Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:44:48.0882412Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:44:48.0882780Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.0883144Z #define __FLT64_DIG__ 15 2025-05-07T19:44:48.0883448Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.0883860Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:44:48.0884249Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.0884550Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:44:48.0884886Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.0885219Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:44:48.0885658Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:44:48.0886103Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:44:48.0886458Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:44:48.0886821Z #define __cpp_unicode_literals 200710L 2025-05-07T19:44:48.0887199Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:44:48.0887558Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:44:48.0887920Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:44:48.0888258Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:44:48.0888717Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:44:48.0889051Z #define __SIZE_WIDTH__ 64 2025-05-07T19:44:48.0889312Z #define __SEG_FS 1 2025-05-07T19:44:48.0889584Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:44:48.0889867Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:44:48.0890180Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.0890477Z #define __SEG_GS 1 2025-05-07T19:44:48.0890824Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:44:48.0891234Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:44:48.0891509Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:44:48.0891819Z #define __INT16_TYPE__ short int 2025-05-07T19:44:48.0892103Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:44:48.0892439Z #define __cpp_structured_bindings 201606L 2025-05-07T19:44:48.0892741Z #define __SIZEOF_INT__ 4 2025-05-07T19:44:48.0893020Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:44:48.0893288Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:44:48.0893670Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:48.0894166Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.0894526Z #define __cpp_sized_deallocation 201309L 2025-05-07T19:44:48.0894949Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:44:48.0895262Z #define linux 1 2025-05-07T19:44:48.0895526Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.0895814Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:44:48.0896118Z #define __EXCEPTIONS 1 2025-05-07T19:44:48.0896372Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:44:48.0896670Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:44:48.0896953Z #define __cpp_range_based_for 201603L 2025-05-07T19:44:48.0897281Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:44:48.0897635Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:48.0898059Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16 2025-05-07T19:44:48.0898437Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:44:48.0898780Z #define __code_model_small__ 1 2025-05-07T19:44:48.0899083Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:44:48.0899400Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:44:48.0899741Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:44:48.0900030Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:44:48.0900357Z #define __k8__ 1 2025-05-07T19:44:48.0900590Z #define __INTPTR_TYPE__ long int 2025-05-07T19:44:48.0900908Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:44:48.0901234Z #define __WCHAR_TYPE__ int 2025-05-07T19:44:48.0901485Z #define __pic__ 2 2025-05-07T19:44:48.0901772Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.0902092Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:44:48.0902394Z #define __cpp_decltype 200707L 2025-05-07T19:44:48.0902694Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.0903057Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:44:48.0903430Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:48.0903825Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:44:48.0904154Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:44:48.0904490Z #define __cpp_inline_variables 201606L 2025-05-07T19:44:48.0904814Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:44:48.0905074Z #define __linux__ 1 2025-05-07T19:44:48.0905335Z #define __INT64_TYPE__ long int 2025-05-07T19:44:48.0905603Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:44:48.0905889Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:44:48.0906167Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:44:48.0906474Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:44:48.0906797Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:44:48.0907114Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.0907462Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:44:48.0907730Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:44:48.0908053Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:44:48.0908356Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:44:48.0908721Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:48.0909087Z #define __SSE__ 1 2025-05-07T19:44:48.0909349Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:44:48.0909693Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:48.0910080Z #define __amd64__ 1 2025-05-07T19:44:48.0910327Z #define __WINT_WIDTH__ 32 2025-05-07T19:44:48.0910619Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:44:48.0910933Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:44:48.0911205Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:44:48.0911521Z #define __SIZEOF_INT128__ 16 2025-05-07T19:44:48.0911785Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:44:48.0912091Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:44:48.0912368Z #define __ATOMIC_RELAXED 0 2025-05-07T19:44:48.0912738Z #define __DBL_EPSILON__ double(2.22044604925031308084726333618164062e-16L) 2025-05-07T19:44:48.0913211Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:44:48.0913704Z #define _LP64 1 2025-05-07T19:44:48.0913963Z #define __UINT8_C(c) c 2025-05-07T19:44:48.0914216Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:44:48.0914586Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:44:48.0914872Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:44:48.0915178Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:44:48.0915540Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:48.0916322Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:48.0916736Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.0917202Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.0917582Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:44:48.0917933Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:44:48.0918391Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:44:48.0918802Z #define __STDCPP_THREADS__ 1 2025-05-07T19:44:48.0919139Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:44:48.0919439Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:44:48.0919842Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:44:48.0920254Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:44:48.0920586Z #define __STDC_UTF_32__ 1 2025-05-07T19:44:48.0920865Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:44:48.0921174Z #define __FXSR__ 1 2025-05-07T19:44:48.0921538Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:48.0922043Z #define __DBL_NORM_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:48.0922532Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:48.0922871Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:44:48.0923206Z #define __cpp_runtime_arrays 198712L 2025-05-07T19:44:48.0923530Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:44:48.0923874Z #define __UINT32_C(c) c ## U 2025-05-07T19:44:48.0942551Z #define __cpp_alias_templates 200704L 2025-05-07T19:44:48.0943080Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:44:48.0943471Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:44:48.0943772Z #define __INT8_MAX__ 0x7f 2025-05-07T19:44:48.0944060Z #define __LONG_WIDTH__ 64 2025-05-07T19:44:48.0944308Z #define __PIC__ 2 2025-05-07T19:44:48.0944600Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:44:48.0945006Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:48.0945423Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:44:48.0945783Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:48.0946136Z #define __cpp_constexpr 201603L 2025-05-07T19:44:48.0946606Z #define __SSE2__ 1 2025-05-07T19:44:48.0947044Z #define __cpp_deduction_guides 201703L 2025-05-07T19:44:48.0947398Z #define __INT32_TYPE__ int 2025-05-07T19:44:48.0947672Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:44:48.0947980Z #define __cpp_exceptions 199711L 2025-05-07T19:44:48.0948281Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:44:48.0948665Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:44:48.0949051Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:44:48.0949378Z #define __INTMAX_TYPE__ long int 2025-05-07T19:44:48.0949686Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:44:48.0949975Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.0950295Z #define __ATOMIC_CONSUME 1 2025-05-07T19:44:48.0950563Z #define __GNUC_MINOR__ 4 2025-05-07T19:44:48.0950865Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:44:48.0951183Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:44:48.0951523Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.0951845Z #define __PIE__ 2 2025-05-07T19:44:48.0952228Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:44:48.0952685Z #define __cpp_template_template_args 201611L 2025-05-07T19:44:48.0953043Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:44:48.0953438Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:44:48.0954047Z #define __INT16_C(c) c 2025-05-07T19:44:48.0954326Z #define __STDC__ 1 2025-05-07T19:44:48.0954655Z #define __FLT32X_DIG__ 15 2025-05-07T19:44:48.0954961Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:44:48.0955262Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:44:48.0955569Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.0956011Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:44:48.0956420Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:44:48.0956839Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:44:48.0957139Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.0957487Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:44:48.0957799Z #define __SSE_MATH__ 1 2025-05-07T19:44:48.0958095Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:44:48.0958409Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:44:48.0958762Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:44:48.0959075Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:44:48.0959420Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.0959710Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:44:48.0960053Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.0960505Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:48.0960909Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:44:48.0961258Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:44:48.0961574Z #define _GNU_SOURCE 1 2025-05-07T19:44:48.0961864Z #define __cpp_init_captures 201304L 2025-05-07T19:44:48.0962168Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:44:48.0962467Z #define __ATOMIC_RELEASE 3 2025-05-07T19:44:48.0962639Z 2025-05-07T19:44:48.1367982Z 2025-05-07T19:44:48.1368345Z + conda run -n build_binary c++ --version 2025-05-07T19:44:48.1368612Z 2025-05-07T19:44:49.9224248Z c++ (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:44:49.9225027Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:44:49.9225595Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:44:49.9226197Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:44:49.9226680Z 2025-05-07T19:44:49.9226686Z 2025-05-07T19:44:49.9801784Z 2025-05-07T19:44:49.9802835Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:44:49.9804580Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:44:49.9805545Z 2025-05-07T19:44:51.8020406Z #define __STDC_VERSION__ 201710L 2025-05-07T19:44:51.8021103Z 2025-05-07T19:44:51.8021650Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:44:51.8022421Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:44:51.8022738Z 2025-05-07T19:44:53.6852879Z #define __cplusplus 201703L 2025-05-07T19:44:53.6859360Z 2025-05-07T19:44:53.6860606Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:44:53.6938991Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:44:53.6939502Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:44:53.6940327Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:53.6940713Z env: 2025-05-07T19:44:53.6940999Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:53.6941337Z BUILD_ENV: build_binary 2025-05-07T19:44:53.6941642Z BUILD_TARGET: default 2025-05-07T19:44:53.6941903Z BUILD_VARIANT: cuda 2025-05-07T19:44:53.6942194Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:53.6942470Z ##[endgroup] 2025-05-07T19:44:54.1652704Z ################################################################################ 2025-05-07T19:44:54.1653118Z # Install Build Tools 2025-05-07T19:44:54.1653380Z # 2025-05-07T19:44:54.1672128Z # [2025-05-07T19:44:54.166Z] + install_build_tools build_binary 2025-05-07T19:44:54.1672598Z ################################################################################ 2025-05-07T19:44:54.1673197Z 2025-05-07T19:44:54.1689924Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:54.2537210Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:54.2545896Z [INSTALL] Installing build tools ... 2025-05-07T19:44:54.2571796Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:44:54.9657964Z Channels: 2025-05-07T19:44:54.9658620Z - conda-forge 2025-05-07T19:44:54.9659270Z Platform: linux-64 2025-05-07T19:44:58.0189631Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:01.3985395Z Solving environment: \ | / - done 2025-05-07T19:45:01.4506417Z 2025-05-07T19:45:01.4506990Z ## Package Plan ## 2025-05-07T19:45:01.4507476Z 2025-05-07T19:45:01.4508071Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:01.4509013Z 2025-05-07T19:45:01.4509561Z added / updated specs: 2025-05-07T19:45:01.4510280Z - auditwheel 2025-05-07T19:45:01.4510873Z - bazel 2025-05-07T19:45:01.4511472Z - cmake[version='>=3.30'] 2025-05-07T19:45:01.4512187Z - hypothesis 2025-05-07T19:45:01.4512603Z - jinja2 2025-05-07T19:45:01.4512801Z - make 2025-05-07T19:45:01.4513009Z - ncurses 2025-05-07T19:45:01.4513212Z - ninja 2025-05-07T19:45:01.4513426Z - openblas 2025-05-07T19:45:01.4513633Z - patchelf 2025-05-07T19:45:01.4513857Z - pyyaml 2025-05-07T19:45:01.4514071Z - rhash 2025-05-07T19:45:01.4514270Z - scikit-build 2025-05-07T19:45:01.4514502Z - wheel 2025-05-07T19:45:01.4514616Z 2025-05-07T19:45:01.4514620Z 2025-05-07T19:45:01.4514742Z The following packages will be downloaded: 2025-05-07T19:45:01.4514979Z 2025-05-07T19:45:01.4515098Z package | build 2025-05-07T19:45:01.4515422Z ---------------------------|----------------- 2025-05-07T19:45:01.4515975Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:01.4516622Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:01.4517162Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:01.4517610Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:01.4518024Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:01.4518449Z cairo-1.18.4 | h3394656_0 955 KB conda-forge 2025-05-07T19:45:01.4518856Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:01.4519281Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:01.4519715Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:01.4520502Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:01.4521070Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:01.4521619Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:01.4522181Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:01.4522790Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:01.4523247Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:01.4523715Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:01.4524172Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:01.4524614Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:01.4525006Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:01.4525577Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:01.4526004Z harfbuzz-11.0.0 | h76408a6_0 1.6 MB conda-forge 2025-05-07T19:45:01.4526426Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:01.4526848Z icu-75.1 | he02047a_0 11.6 MB conda-forge 2025-05-07T19:45:01.4527212Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:01.4527611Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:01.4528011Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:01.4528412Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:01.4528800Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:01.4529172Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:01.4529616Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:01.4530048Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:01.4530463Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:01.4530892Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:01.4531333Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:01.4531771Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:01.4532160Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:01.4532597Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:01.4533028Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:01.4533485Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:01.4533943Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:01.4534360Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:01.4534775Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:01.4535178Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:45:01.4535615Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:01.4536032Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:01.4536469Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:01.4536924Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:01.4537329Z libnsl-2.0.1 | hd590300_0 33 KB conda-forge 2025-05-07T19:45:01.4537885Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:01.4538314Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:01.4538742Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:01.4539192Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:01.4539612Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:01.4540036Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:01.4540433Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:01.4540854Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:01.4541247Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:01.4541682Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:01.4542168Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:01.4542561Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:01.4542961Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:01.4543368Z markupsafe-3.0.2 | py310h89163eb_1 23 KB conda-forge 2025-05-07T19:45:01.4543800Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:01.4544186Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:01.4544628Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:01.4545073Z openjdk-23.0.2 | h53dfc1b_2 181.4 MB conda-forge 2025-05-07T19:45:01.4545479Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:01.4545914Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:01.4546302Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:01.4546906Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:01.4547538Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:01.4547989Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:01.4548449Z python-3.10.17 |hd6af730_0_cpython 23.9 MB conda-forge 2025-05-07T19:45:01.4548881Z pyyaml-6.0.2 | py310h89163eb_2 178 KB conda-forge 2025-05-07T19:45:01.4549309Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:01.4549708Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:01.4550160Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:01.4550629Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:01.4551098Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:01.4551572Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:01.4551975Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:01.4552397Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:01.4552926Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:01.4553354Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:01.4553786Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:01.4554202Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:01.4554788Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:01.4555227Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:01.4555680Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:01.4556430Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:01.4556940Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:01.4557428Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:01.4557916Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:01.4558408Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:01.4558864Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:01.4559310Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:01.4559848Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:01.4560293Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:01.4560727Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:01.4561124Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:01.4561542Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:01.4561936Z ------------------------------------------------------------ 2025-05-07T19:45:01.4562309Z Total: 343.7 MB 2025-05-07T19:45:01.4562639Z 2025-05-07T19:45:01.4562779Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:01.4562993Z 2025-05-07T19:45:01.4563203Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:01.4563640Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:01.4564079Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:01.4564523Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:01.4564940Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:01.4565338Z cairo conda-forge/linux-64::cairo-1.18.4-h3394656_0 2025-05-07T19:45:01.4565754Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:01.4566146Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:01.4566565Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:01.4567055Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:01.4567626Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:01.4568239Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:01.4568823Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:01.4569391Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:01.4572357Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:01.4572848Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:01.4573341Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:01.4573789Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:01.4574226Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:01.4574674Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:01.4575124Z harfbuzz conda-forge/linux-64::harfbuzz-11.0.0-h76408a6_0 2025-05-07T19:45:01.4575741Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:01.4576176Z icu conda-forge/linux-64::icu-75.1-he02047a_0 2025-05-07T19:45:01.4576565Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:01.4576978Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:01.4577397Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:01.4577828Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:01.4578215Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:01.4578619Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:01.4579086Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:01.4579577Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:01.4580016Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:01.4580539Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:01.4581031Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:01.4581477Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:01.4581912Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:01.4582391Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:01.4582888Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:01.4583406Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:01.4583887Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:01.4584373Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:01.4584819Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:01.4585261Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:45:01.4585759Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:01.4586225Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:01.4586714Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:01.4587224Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:01.4587667Z libnsl conda-forge/linux-64::libnsl-2.0.1-hd590300_0 2025-05-07T19:45:01.4588167Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:01.4588653Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:01.4589127Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:01.4589623Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:01.4590089Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:01.4590544Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:01.4590966Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:01.4591384Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:01.4591838Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:01.4592283Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:01.4592708Z libzlib conda-forge/linux-64::libzlib-1.3.1-hb9d3cd8_2 2025-05-07T19:45:01.4593101Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:01.4593548Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py310h89163eb_1 2025-05-07T19:45:01.4593992Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:01.4594528Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:01.4595014Z openjdk conda-forge/linux-64::openjdk-23.0.2-h53dfc1b_2 2025-05-07T19:45:01.4595448Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:01.4596025Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:01.4596642Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:01.4597092Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:01.4597612Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:01.4598138Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:01.4598645Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py310h89163eb_2 2025-05-07T19:45:01.4599086Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:01.4599521Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:01.4600093Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:01.4600600Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:01.4601159Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:01.4601677Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:01.4602165Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:01.4602682Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:01.4603177Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:01.4603700Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:01.4604223Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:01.4604776Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:01.4605333Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:01.4605852Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:01.4606382Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:01.4606945Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:01.4607494Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:01.4608009Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:01.4608639Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:01.4609130Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:01.4609549Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:01.4609958Z zstd conda-forge/linux-64::zstd-1.5.7-hb8e6e7a_2 2025-05-07T19:45:01.4610216Z 2025-05-07T19:45:01.4610353Z The following packages will be UPDATED: 2025-05-07T19:45:01.4610562Z 2025-05-07T19:45:01.4610845Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:01.4611495Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:01.4612147Z python pkgs/main::python-3.10.16-he870216_1 --> conda-forge::python-3.10.17-hd6af730_0_cpython 2025-05-07T19:45:01.4612810Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:01.4613479Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:01.4614081Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:01.4614721Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.3.1-hb9d3cd8_2 2025-05-07T19:45:01.4615064Z 2025-05-07T19:45:01.4615302Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:01.4615624Z 2025-05-07T19:45:01.4615855Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:01.4616210Z 2025-05-07T19:45:01.4616246Z 2025-05-07T19:45:01.4616249Z 2025-05-07T19:45:01.4616396Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:01.4616790Z openjdk-23.0.2 | 181.4 MB | | 0% 2025-05-07T19:45:01.4617025Z 2025-05-07T19:45:01.4617326Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:01.4617558Z 2025-05-07T19:45:01.4617581Z 2025-05-07T19:45:01.4617791Z python-3.10.17 | 23.9 MB | | 0%  2025-05-07T19:45:01.4618036Z 2025-05-07T19:45:01.4618039Z 2025-05-07T19:45:01.4618043Z 2025-05-07T19:45:01.4623042Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:01.4623966Z 2025-05-07T19:45:01.4623977Z 2025-05-07T19:45:01.4623988Z 2025-05-07T19:45:01.4624397Z 2025-05-07T19:45:01.4636969Z icu-75.1 | 11.6 MB | | 0%  2025-05-07T19:45:01.4637772Z 2025-05-07T19:45:01.4637787Z 2025-05-07T19:45:01.4637797Z 2025-05-07T19:45:01.4637809Z 2025-05-07T19:45:01.4637847Z 2025-05-07T19:45:01.4638569Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:01.4639364Z 2025-05-07T19:45:01.4639375Z 2025-05-07T19:45:01.4639385Z 2025-05-07T19:45:01.4639396Z 2025-05-07T19:45:01.4639407Z 2025-05-07T19:45:01.4639417Z 2025-05-07T19:45:01.4640181Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:01.4640985Z 2025-05-07T19:45:01.4640996Z 2025-05-07T19:45:01.4641006Z 2025-05-07T19:45:01.4641017Z 2025-05-07T19:45:01.4641027Z 2025-05-07T19:45:01.4641039Z 2025-05-07T19:45:01.4641049Z 2025-05-07T19:45:01.4641833Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:01.4642744Z 2025-05-07T19:45:01.4642748Z 2025-05-07T19:45:01.4642752Z 2025-05-07T19:45:01.4642756Z 2025-05-07T19:45:01.4642759Z 2025-05-07T19:45:01.4642763Z 2025-05-07T19:45:01.4642766Z 2025-05-07T19:45:01.4642769Z 2025-05-07T19:45:01.4643047Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:01.4643332Z 2025-05-07T19:45:01.4643336Z 2025-05-07T19:45:01.4643339Z 2025-05-07T19:45:01.4643343Z 2025-05-07T19:45:01.4643347Z 2025-05-07T19:45:01.4643350Z 2025-05-07T19:45:01.4643354Z 2025-05-07T19:45:01.4643357Z 2025-05-07T19:45:01.4643361Z 2025-05-07T19:45:01.4643607Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:01.4643904Z 2025-05-07T19:45:01.4643909Z 2025-05-07T19:45:01.4643913Z 2025-05-07T19:45:01.4643916Z 2025-05-07T19:45:01.4643919Z 2025-05-07T19:45:01.4643923Z 2025-05-07T19:45:01.4643926Z 2025-05-07T19:45:01.4643929Z 2025-05-07T19:45:01.4643944Z 2025-05-07T19:45:01.4643948Z 2025-05-07T19:45:01.4644215Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:01.4644530Z 2025-05-07T19:45:01.4644535Z 2025-05-07T19:45:01.4644538Z 2025-05-07T19:45:01.4644541Z 2025-05-07T19:45:01.4644545Z 2025-05-07T19:45:01.4644548Z 2025-05-07T19:45:01.4644551Z 2025-05-07T19:45:01.4644555Z 2025-05-07T19:45:01.4644558Z 2025-05-07T19:45:01.4644562Z 2025-05-07T19:45:01.4644566Z 2025-05-07T19:45:01.4644833Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:01.4645097Z 2025-05-07T19:45:01.4645101Z 2025-05-07T19:45:01.4645104Z 2025-05-07T19:45:01.4645108Z 2025-05-07T19:45:01.4645111Z 2025-05-07T19:45:01.4645115Z 2025-05-07T19:45:01.4645119Z 2025-05-07T19:45:01.4645122Z 2025-05-07T19:45:01.4645125Z 2025-05-07T19:45:01.4645129Z 2025-05-07T19:45:01.4645132Z 2025-05-07T19:45:01.4645136Z 2025-05-07T19:45:01.4645610Z harfbuzz-11.0.0 | 1.6 MB | | 0%  2025-05-07T19:45:01.4645915Z 2025-05-07T19:45:01.4645918Z 2025-05-07T19:45:01.4645922Z 2025-05-07T19:45:01.4645925Z 2025-05-07T19:45:01.4645929Z 2025-05-07T19:45:01.4645933Z 2025-05-07T19:45:01.4645937Z 2025-05-07T19:45:01.4645940Z 2025-05-07T19:45:01.4645944Z 2025-05-07T19:45:01.4645948Z 2025-05-07T19:45:01.4645951Z 2025-05-07T19:45:01.4645972Z 2025-05-07T19:45:01.4645976Z 2025-05-07T19:45:01.4646290Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:01.4646781Z 2025-05-07T19:45:01.4646785Z 2025-05-07T19:45:01.4646789Z 2025-05-07T19:45:01.4646792Z 2025-05-07T19:45:01.4646795Z 2025-05-07T19:45:01.4646799Z 2025-05-07T19:45:01.4646802Z 2025-05-07T19:45:01.4646822Z 2025-05-07T19:45:01.4646826Z 2025-05-07T19:45:01.4646829Z 2025-05-07T19:45:01.4646833Z 2025-05-07T19:45:01.4646836Z 2025-05-07T19:45:01.4646840Z 2025-05-07T19:45:01.4646843Z 2025-05-07T19:45:01.4647328Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:01.4647646Z 2025-05-07T19:45:01.4647667Z 2025-05-07T19:45:01.4647670Z 2025-05-07T19:45:01.4647674Z 2025-05-07T19:45:01.4647677Z 2025-05-07T19:45:01.4647680Z 2025-05-07T19:45:01.4647684Z 2025-05-07T19:45:01.4647687Z 2025-05-07T19:45:01.4647691Z 2025-05-07T19:45:01.4647694Z 2025-05-07T19:45:01.4647697Z 2025-05-07T19:45:01.4647701Z 2025-05-07T19:45:01.4647704Z 2025-05-07T19:45:01.4647710Z 2025-05-07T19:45:01.4647713Z 2025-05-07T19:45:01.4648176Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:01.4648525Z 2025-05-07T19:45:01.4648530Z 2025-05-07T19:45:01.4648535Z 2025-05-07T19:45:01.4648539Z 2025-05-07T19:45:01.4648543Z 2025-05-07T19:45:01.4648546Z 2025-05-07T19:45:01.4648563Z 2025-05-07T19:45:01.4648567Z 2025-05-07T19:45:01.4648571Z 2025-05-07T19:45:01.4648575Z 2025-05-07T19:45:01.4648579Z 2025-05-07T19:45:01.4648595Z 2025-05-07T19:45:01.4648612Z 2025-05-07T19:45:01.4648616Z 2025-05-07T19:45:01.4648619Z 2025-05-07T19:45:01.4648623Z 2025-05-07T19:45:01.4649042Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:01.4649383Z 2025-05-07T19:45:01.4649405Z 2025-05-07T19:45:01.4649409Z 2025-05-07T19:45:01.4649412Z 2025-05-07T19:45:01.4649416Z 2025-05-07T19:45:01.4649420Z 2025-05-07T19:45:01.4649424Z 2025-05-07T19:45:01.4649428Z 2025-05-07T19:45:01.4649449Z 2025-05-07T19:45:01.4649453Z 2025-05-07T19:45:01.4649456Z 2025-05-07T19:45:01.4649460Z 2025-05-07T19:45:01.4649463Z 2025-05-07T19:45:01.4649466Z 2025-05-07T19:45:01.4649470Z 2025-05-07T19:45:01.4649473Z 2025-05-07T19:45:01.4649476Z 2025-05-07T19:45:01.4650143Z cairo-1.18.4 | 955 KB | | 0%  2025-05-07T19:45:01.4650455Z 2025-05-07T19:45:01.4650480Z 2025-05-07T19:45:01.4650483Z 2025-05-07T19:45:01.4650491Z 2025-05-07T19:45:01.4650499Z 2025-05-07T19:45:01.4650503Z 2025-05-07T19:45:01.4650506Z 2025-05-07T19:45:01.4650509Z 2025-05-07T19:45:01.4650513Z 2025-05-07T19:45:01.4650516Z 2025-05-07T19:45:01.4650519Z 2025-05-07T19:45:01.4650523Z 2025-05-07T19:45:01.4650527Z 2025-05-07T19:45:01.4650530Z 2025-05-07T19:45:01.4650534Z 2025-05-07T19:45:01.4650537Z 2025-05-07T19:45:01.4650540Z 2025-05-07T19:45:01.4650544Z 2025-05-07T19:45:01.4651108Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:01.4651419Z 2025-05-07T19:45:01.4651423Z 2025-05-07T19:45:01.4651447Z 2025-05-07T19:45:01.4651450Z 2025-05-07T19:45:01.4651454Z 2025-05-07T19:45:01.4651457Z 2025-05-07T19:45:01.4651460Z 2025-05-07T19:45:01.4651464Z 2025-05-07T19:45:01.4651467Z 2025-05-07T19:45:01.4651471Z 2025-05-07T19:45:01.4651494Z 2025-05-07T19:45:01.4651498Z 2025-05-07T19:45:01.4651501Z 2025-05-07T19:45:01.4651505Z 2025-05-07T19:45:01.4651508Z 2025-05-07T19:45:01.4651660Z 2025-05-07T19:45:01.4651664Z 2025-05-07T19:45:01.4651668Z 2025-05-07T19:45:01.4651671Z 2025-05-07T19:45:01.6128960Z ... (more hidden) ... 2025-05-07T19:45:01.6129888Z 2025-05-07T19:45:01.6129902Z 2025-05-07T19:45:01.6129914Z 2025-05-07T19:45:01.6129924Z 2025-05-07T19:45:01.7135923Z icu-75.1 | 11.6 MB | | 0%  2025-05-07T19:45:01.7136205Z 2025-05-07T19:45:01.7136210Z 2025-05-07T19:45:01.7136214Z 2025-05-07T19:45:01.7136218Z 2025-05-07T19:45:01.7975652Z icu-75.1 | 11.6 MB | | 1%  2025-05-07T19:45:01.7976427Z 2025-05-07T19:45:01.7976457Z 2025-05-07T19:45:01.7976469Z 2025-05-07T19:45:01.8141817Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:01.8142140Z 2025-05-07T19:45:01.8142305Z 2025-05-07T19:45:01.8142309Z 2025-05-07T19:45:01.8142562Z 2025-05-07T19:45:01.8199454Z icu-75.1 | 11.6 MB | ###7 | 37%  2025-05-07T19:45:01.8200032Z 2025-05-07T19:45:01.8200037Z 2025-05-07T19:45:01.8345062Z python-3.10.17 | 23.9 MB | | 0%  2025-05-07T19:45:01.8521191Z openjdk-23.0.2 | 181.4 MB | | 0% 2025-05-07T19:45:01.8521498Z 2025-05-07T19:45:01.8975145Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:01.8975442Z 2025-05-07T19:45:01.8975458Z 2025-05-07T19:45:01.8975464Z 2025-05-07T19:45:01.9144163Z cmake-4.0.2 | 19.4 MB | #### | 41%  2025-05-07T19:45:01.9144465Z 2025-05-07T19:45:01.9144470Z 2025-05-07T19:45:01.9144513Z 2025-05-07T19:45:01.9144518Z 2025-05-07T19:45:01.9255389Z icu-75.1 | 11.6 MB | ########2 | 82%  2025-05-07T19:45:01.9255729Z 2025-05-07T19:45:01.9255735Z 2025-05-07T19:45:01.9348577Z python-3.10.17 | 23.9 MB | ##1 | 21%  2025-05-07T19:45:01.9523514Z openjdk-23.0.2 | 181.4 MB | 3 | 3% 2025-05-07T19:45:01.9523925Z 2025-05-07T19:45:01.9975321Z bazel-7.5.0 | 47.4 MB | #3 | 14%  2025-05-07T19:45:01.9975614Z 2025-05-07T19:45:01.9975657Z 2025-05-07T19:45:01.9975661Z 2025-05-07T19:45:02.0258838Z cmake-4.0.2 | 19.4 MB | #######6 | 76%  2025-05-07T19:45:02.0259360Z 2025-05-07T19:45:02.0259383Z 2025-05-07T19:45:02.0349017Z python-3.10.17 | 23.9 MB | ####6 | 46%  2025-05-07T19:45:02.0595062Z openjdk-23.0.2 | 181.4 MB | 6 | 7% 2025-05-07T19:45:02.0595613Z 2025-05-07T19:45:02.0909210Z bazel-7.5.0 | 47.4 MB | ##6 | 26%  2025-05-07T19:45:02.0909508Z 2025-05-07T19:45:02.0909513Z 2025-05-07T19:45:02.0909518Z 2025-05-07T19:45:02.0909521Z 2025-05-07T19:45:02.1340602Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:02.1340937Z 2025-05-07T19:45:02.1340943Z 2025-05-07T19:45:02.1345139Z python-3.10.17 | 23.9 MB | ######8 | 68%  2025-05-07T19:45:02.1352418Z openjdk-23.0.2 | 181.4 MB | 9 | 10% 2025-05-07T19:45:02.1352699Z 2025-05-07T19:45:02.1352704Z 2025-05-07T19:45:02.1352718Z 2025-05-07T19:45:02.1352721Z 2025-05-07T19:45:02.1352749Z 2025-05-07T19:45:02.1599428Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:02.1599748Z 2025-05-07T19:45:02.1794760Z bazel-7.5.0 | 47.4 MB | ###7 | 37%  2025-05-07T19:45:02.1795048Z 2025-05-07T19:45:02.1795054Z 2025-05-07T19:45:02.1795058Z 2025-05-07T19:45:02.2263103Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:02.2263411Z 2025-05-07T19:45:02.2263418Z 2025-05-07T19:45:02.2263424Z 2025-05-07T19:45:02.2263430Z 2025-05-07T19:45:02.2263436Z 2025-05-07T19:45:02.2263441Z 2025-05-07T19:45:02.2356442Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:02.2356911Z 2025-05-07T19:45:02.2356919Z 2025-05-07T19:45:02.2356922Z 2025-05-07T19:45:02.2356927Z 2025-05-07T19:45:02.2356961Z 2025-05-07T19:45:02.2402063Z libgrpc-1.71.0 | 7.6 MB | #####5 | 55%  2025-05-07T19:45:02.2402391Z 2025-05-07T19:45:02.2402396Z 2025-05-07T19:45:02.2425624Z python-3.10.17 | 23.9 MB | ########7 | 87%  2025-05-07T19:45:02.2617795Z openjdk-23.0.2 | 181.4 MB | #2 | 12% 2025-05-07T19:45:02.2618388Z 2025-05-07T19:45:02.3264394Z bazel-7.5.0 | 47.4 MB | ####7 | 48%  2025-05-07T19:45:02.3264694Z 2025-05-07T19:45:02.3264703Z 2025-05-07T19:45:02.3264711Z 2025-05-07T19:45:02.3264717Z 2025-05-07T19:45:02.3264723Z 2025-05-07T19:45:02.3264729Z 2025-05-07T19:45:02.3512073Z openblas-0.3.29 | 5.8 MB | ######8 | 68%  2025-05-07T19:45:02.3711557Z openjdk-23.0.2 | 181.4 MB | #5 | 15% 2025-05-07T19:45:02.3711922Z 2025-05-07T19:45:02.4500627Z bazel-7.5.0 | 47.4 MB | #####8 | 58%  2025-05-07T19:45:02.4500975Z 2025-05-07T19:45:02.4501074Z 2025-05-07T19:45:02.4501082Z 2025-05-07T19:45:02.4501410Z 2025-05-07T19:45:02.4501414Z 2025-05-07T19:45:02.4501784Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:02.4502058Z 2025-05-07T19:45:02.4502062Z 2025-05-07T19:45:02.4502066Z 2025-05-07T19:45:02.4502069Z 2025-05-07T19:45:02.4502465Z 2025-05-07T19:45:02.4515156Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:02.4546937Z openjdk-23.0.2 | 181.4 MB | #8 | 19% 2025-05-07T19:45:02.4547251Z 2025-05-07T19:45:02.4547430Z 2025-05-07T19:45:02.4547444Z 2025-05-07T19:45:02.4547451Z 2025-05-07T19:45:02.4547457Z 2025-05-07T19:45:02.4547462Z 2025-05-07T19:45:02.4714797Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:02.4715682Z 2025-05-07T19:45:02.5018160Z bazel-7.5.0 | 47.4 MB | #######4 | 75%  2025-05-07T19:45:02.5018630Z 2025-05-07T19:45:02.5018660Z 2025-05-07T19:45:02.5018666Z 2025-05-07T19:45:02.5018672Z 2025-05-07T19:45:02.5018714Z 2025-05-07T19:45:02.5018744Z 2025-05-07T19:45:02.5018750Z 2025-05-07T19:45:02.5466822Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:02.5467163Z 2025-05-07T19:45:02.5467355Z 2025-05-07T19:45:02.5467362Z 2025-05-07T19:45:02.5467370Z 2025-05-07T19:45:02.5467380Z 2025-05-07T19:45:02.5467387Z 2025-05-07T19:45:02.5467392Z 2025-05-07T19:45:02.5467395Z 2025-05-07T19:45:02.5744743Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:02.5871840Z openjdk-23.0.2 | 181.4 MB | ##1 | 22% 2025-05-07T19:45:02.5872621Z 2025-05-07T19:45:02.6018698Z bazel-7.5.0 | 47.4 MB | ########6 | 87%  2025-05-07T19:45:02.6018985Z 2025-05-07T19:45:02.6019098Z 2025-05-07T19:45:02.6019108Z 2025-05-07T19:45:02.6019119Z 2025-05-07T19:45:02.6019123Z 2025-05-07T19:45:02.6019128Z 2025-05-07T19:45:02.6020346Z 2025-05-07T19:45:02.6469308Z libopenblas-0.3.29 | 5.6 MB | #######6 | 76%  2025-05-07T19:45:02.6469721Z 2025-05-07T19:45:02.6469726Z 2025-05-07T19:45:02.6469730Z 2025-05-07T19:45:02.6469734Z 2025-05-07T19:45:02.6469737Z 2025-05-07T19:45:02.6469740Z 2025-05-07T19:45:02.6469745Z 2025-05-07T19:45:02.6469749Z 2025-05-07T19:45:02.6881154Z libcups-2.3.3 | 4.3 MB | ########3 | 83%  2025-05-07T19:45:02.7069891Z openjdk-23.0.2 | 181.4 MB | ##4 | 24% 2025-05-07T19:45:02.7070161Z 2025-05-07T19:45:02.7351398Z bazel-7.5.0 | 47.4 MB | #########8 | 98%  2025-05-07T19:45:02.7351807Z 2025-05-07T19:45:02.7351815Z 2025-05-07T19:45:02.7351822Z 2025-05-07T19:45:02.7351828Z 2025-05-07T19:45:02.7351835Z 2025-05-07T19:45:02.7351843Z 2025-05-07T19:45:02.7351849Z 2025-05-07T19:45:02.7351870Z 2025-05-07T19:45:02.7449275Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:02.7449576Z 2025-05-07T19:45:02.7449581Z 2025-05-07T19:45:02.7449586Z 2025-05-07T19:45:02.7449590Z 2025-05-07T19:45:02.7449816Z 2025-05-07T19:45:02.7449821Z 2025-05-07T19:45:02.7450547Z 2025-05-07T19:45:02.7779195Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:02.7779532Z 2025-05-07T19:45:02.7779537Z 2025-05-07T19:45:02.7814744Z python-3.10.17 | 23.9 MB | ########## | 100%  2025-05-07T19:45:02.7815129Z 2025-05-07T19:45:02.7815135Z 2025-05-07T19:45:02.7815139Z 2025-05-07T19:45:02.7815145Z 2025-05-07T19:45:02.7815151Z 2025-05-07T19:45:02.7815156Z 2025-05-07T19:45:02.7815160Z 2025-05-07T19:45:02.7815165Z 2025-05-07T19:45:02.7815169Z 2025-05-07T19:45:02.7815174Z 2025-05-07T19:45:02.7881908Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:02.7955815Z openjdk-23.0.2 | 181.4 MB | ##7 | 28% 2025-05-07T19:45:02.7956148Z 2025-05-07T19:45:02.7956154Z 2025-05-07T19:45:02.7956158Z 2025-05-07T19:45:02.7956163Z 2025-05-07T19:45:02.7956168Z 2025-05-07T19:45:02.7956172Z 2025-05-07T19:45:02.7956434Z 2025-05-07T19:45:02.7956456Z 2025-05-07T19:45:02.7956459Z 2025-05-07T19:45:02.8163726Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:02.8164088Z 2025-05-07T19:45:02.8164093Z 2025-05-07T19:45:02.8164097Z 2025-05-07T19:45:02.8164100Z 2025-05-07T19:45:02.8164104Z 2025-05-07T19:45:02.8164107Z 2025-05-07T19:45:02.8164111Z 2025-05-07T19:45:02.8164114Z 2025-05-07T19:45:02.8164118Z 2025-05-07T19:45:02.8164121Z 2025-05-07T19:45:02.8164125Z 2025-05-07T19:45:02.8882995Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:02.9005290Z openjdk-23.0.2 | 181.4 MB | ### | 31% 2025-05-07T19:45:02.9005570Z 2025-05-07T19:45:02.9005577Z 2025-05-07T19:45:02.9005581Z 2025-05-07T19:45:02.9005586Z 2025-05-07T19:45:02.9005615Z 2025-05-07T19:45:02.9005620Z 2025-05-07T19:45:02.9005624Z 2025-05-07T19:45:02.9005630Z 2025-05-07T19:45:02.9005635Z 2025-05-07T19:45:02.9008533Z 2025-05-07T19:45:02.9009047Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:02.9009401Z 2025-05-07T19:45:02.9009438Z 2025-05-07T19:45:02.9009441Z 2025-05-07T19:45:02.9009445Z 2025-05-07T19:45:02.9009448Z 2025-05-07T19:45:02.9009452Z 2025-05-07T19:45:02.9009455Z 2025-05-07T19:45:02.9009459Z 2025-05-07T19:45:02.9009462Z 2025-05-07T19:45:02.9009465Z 2025-05-07T19:45:02.9305693Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:02.9306079Z 2025-05-07T19:45:02.9306084Z 2025-05-07T19:45:02.9306087Z 2025-05-07T19:45:02.9306091Z 2025-05-07T19:45:02.9306095Z 2025-05-07T19:45:02.9306098Z 2025-05-07T19:45:02.9306102Z 2025-05-07T19:45:02.9306105Z 2025-05-07T19:45:02.9306108Z 2025-05-07T19:45:02.9306112Z 2025-05-07T19:45:02.9306115Z 2025-05-07T19:45:02.9306367Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:02.9306658Z 2025-05-07T19:45:02.9306679Z 2025-05-07T19:45:02.9306695Z 2025-05-07T19:45:02.9306698Z 2025-05-07T19:45:02.9306702Z 2025-05-07T19:45:02.9306705Z 2025-05-07T19:45:02.9306709Z 2025-05-07T19:45:02.9306712Z 2025-05-07T19:45:02.9306715Z 2025-05-07T19:45:02.9306719Z 2025-05-07T19:45:02.9306722Z 2025-05-07T19:45:02.9313334Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:02.9313648Z 2025-05-07T19:45:02.9313668Z 2025-05-07T19:45:02.9313672Z 2025-05-07T19:45:02.9313675Z 2025-05-07T19:45:02.9313679Z 2025-05-07T19:45:02.9313682Z 2025-05-07T19:45:02.9313686Z 2025-05-07T19:45:02.9313689Z 2025-05-07T19:45:02.9313692Z 2025-05-07T19:45:02.9314040Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:02.9314329Z 2025-05-07T19:45:02.9314332Z 2025-05-07T19:45:02.9314336Z 2025-05-07T19:45:02.9314339Z 2025-05-07T19:45:02.9314343Z 2025-05-07T19:45:02.9314346Z 2025-05-07T19:45:02.9314350Z 2025-05-07T19:45:02.9314353Z 2025-05-07T19:45:02.9314370Z 2025-05-07T19:45:02.9518407Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:02.9518737Z 2025-05-07T19:45:02.9518751Z 2025-05-07T19:45:02.9518755Z 2025-05-07T19:45:02.9518758Z 2025-05-07T19:45:02.9637067Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:02.9637398Z 2025-05-07T19:45:02.9637403Z 2025-05-07T19:45:02.9637407Z 2025-05-07T19:45:02.9637411Z 2025-05-07T19:45:02.9637415Z 2025-05-07T19:45:02.9637419Z 2025-05-07T19:45:02.9637422Z 2025-05-07T19:45:02.9637426Z 2025-05-07T19:45:02.9637429Z 2025-05-07T19:45:02.9637433Z 2025-05-07T19:45:02.9637437Z 2025-05-07T19:45:02.9637441Z 2025-05-07T19:45:02.9637444Z 2025-05-07T19:45:02.9878189Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:02.9878563Z 2025-05-07T19:45:02.9878568Z 2025-05-07T19:45:02.9878573Z 2025-05-07T19:45:02.9878580Z 2025-05-07T19:45:02.9878585Z 2025-05-07T19:45:02.9878860Z 2025-05-07T19:45:02.9878881Z 2025-05-07T19:45:02.9878885Z 2025-05-07T19:45:02.9878888Z 2025-05-07T19:45:02.9878892Z 2025-05-07T19:45:02.9878895Z 2025-05-07T19:45:02.9878898Z 2025-05-07T19:45:02.9878928Z 2025-05-07T19:45:02.9878932Z 2025-05-07T19:45:03.0021425Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:03.0021779Z 2025-05-07T19:45:03.0021784Z 2025-05-07T19:45:03.0021789Z 2025-05-07T19:45:03.0021792Z 2025-05-07T19:45:03.0021796Z 2025-05-07T19:45:03.0021800Z 2025-05-07T19:45:03.0021827Z 2025-05-07T19:45:03.0021832Z 2025-05-07T19:45:03.0021835Z 2025-05-07T19:45:03.0021849Z 2025-05-07T19:45:03.0021853Z 2025-05-07T19:45:03.0021856Z 2025-05-07T19:45:03.0021860Z 2025-05-07T19:45:03.0077021Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:03.0077398Z 2025-05-07T19:45:03.0077422Z 2025-05-07T19:45:03.0077429Z 2025-05-07T19:45:03.0077432Z 2025-05-07T19:45:03.0077476Z 2025-05-07T19:45:03.0077507Z 2025-05-07T19:45:03.0077511Z 2025-05-07T19:45:03.0077514Z 2025-05-07T19:45:03.0077518Z 2025-05-07T19:45:03.0077521Z 2025-05-07T19:45:03.0077524Z 2025-05-07T19:45:03.0077528Z 2025-05-07T19:45:03.0116233Z harfbuzz-11.0.0 | 1.6 MB | | 1%  2025-05-07T19:45:03.0278318Z openjdk-23.0.2 | 181.4 MB | ###3 | 34% 2025-05-07T19:45:03.0278600Z 2025-05-07T19:45:03.0278605Z 2025-05-07T19:45:03.0278609Z 2025-05-07T19:45:03.0278613Z 2025-05-07T19:45:03.0278616Z 2025-05-07T19:45:03.0278620Z 2025-05-07T19:45:03.0278636Z 2025-05-07T19:45:03.0278640Z 2025-05-07T19:45:03.0278644Z 2025-05-07T19:45:03.0278647Z 2025-05-07T19:45:03.0278650Z 2025-05-07T19:45:03.0278654Z 2025-05-07T19:45:03.0278657Z 2025-05-07T19:45:03.0278661Z 2025-05-07T19:45:03.0428958Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:03.0429334Z 2025-05-07T19:45:03.0429354Z 2025-05-07T19:45:03.0429369Z 2025-05-07T19:45:03.0429372Z 2025-05-07T19:45:03.0429376Z 2025-05-07T19:45:03.0429380Z 2025-05-07T19:45:03.0429383Z 2025-05-07T19:45:03.0429386Z 2025-05-07T19:45:03.0429390Z 2025-05-07T19:45:03.0429393Z 2025-05-07T19:45:03.0429397Z 2025-05-07T19:45:03.0429400Z 2025-05-07T19:45:03.0429403Z 2025-05-07T19:45:03.0429407Z 2025-05-07T19:45:03.0429410Z 2025-05-07T19:45:03.0795853Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:03.0796203Z 2025-05-07T19:45:03.0796208Z 2025-05-07T19:45:03.0796212Z 2025-05-07T19:45:03.0796215Z 2025-05-07T19:45:03.0796219Z 2025-05-07T19:45:03.0796222Z 2025-05-07T19:45:03.0796226Z 2025-05-07T19:45:03.0796229Z 2025-05-07T19:45:03.0796233Z 2025-05-07T19:45:03.0796236Z 2025-05-07T19:45:03.0796240Z 2025-05-07T19:45:03.0796243Z 2025-05-07T19:45:03.0796247Z 2025-05-07T19:45:03.0796250Z 2025-05-07T19:45:03.0796254Z 2025-05-07T19:45:03.0825390Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:03.0826361Z 2025-05-07T19:45:03.0826376Z 2025-05-07T19:45:03.0826387Z 2025-05-07T19:45:03.0826398Z 2025-05-07T19:45:03.0826409Z 2025-05-07T19:45:03.0826419Z 2025-05-07T19:45:03.0826430Z 2025-05-07T19:45:03.0826468Z 2025-05-07T19:45:03.0826479Z 2025-05-07T19:45:03.0826490Z 2025-05-07T19:45:03.0826500Z 2025-05-07T19:45:03.0826510Z 2025-05-07T19:45:03.0826520Z 2025-05-07T19:45:03.0826530Z 2025-05-07T19:45:03.0826541Z 2025-05-07T19:45:03.0826551Z 2025-05-07T19:45:03.1117444Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:03.1229354Z openjdk-23.0.2 | 181.4 MB | ###8 | 38% 2025-05-07T19:45:03.1230179Z 2025-05-07T19:45:03.1230194Z 2025-05-07T19:45:03.1230207Z 2025-05-07T19:45:03.1230219Z 2025-05-07T19:45:03.1230230Z 2025-05-07T19:45:03.1230241Z 2025-05-07T19:45:03.1230284Z 2025-05-07T19:45:03.1230296Z 2025-05-07T19:45:03.1230730Z 2025-05-07T19:45:03.1230760Z 2025-05-07T19:45:03.1230771Z 2025-05-07T19:45:03.1230781Z 2025-05-07T19:45:03.1230791Z 2025-05-07T19:45:03.1230801Z 2025-05-07T19:45:03.1230812Z 2025-05-07T19:45:03.1230822Z 2025-05-07T19:45:03.1230832Z 2025-05-07T19:45:03.1231831Z cairo-1.18.4 | 955 KB | 1 | 2%  2025-05-07T19:45:03.1232768Z 2025-05-07T19:45:03.1232780Z 2025-05-07T19:45:03.1232791Z 2025-05-07T19:45:03.1232801Z 2025-05-07T19:45:03.1232812Z 2025-05-07T19:45:03.1232822Z 2025-05-07T19:45:03.1232833Z 2025-05-07T19:45:03.1232843Z 2025-05-07T19:45:03.1232853Z 2025-05-07T19:45:03.1232863Z 2025-05-07T19:45:03.1232873Z 2025-05-07T19:45:03.1232883Z 2025-05-07T19:45:03.1232893Z 2025-05-07T19:45:03.1232903Z 2025-05-07T19:45:03.1232913Z 2025-05-07T19:45:03.1232923Z 2025-05-07T19:45:03.1246374Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:03.1246871Z 2025-05-07T19:45:03.1246884Z 2025-05-07T19:45:03.1246888Z 2025-05-07T19:45:03.1246901Z 2025-05-07T19:45:03.1246904Z 2025-05-07T19:45:03.1246908Z 2025-05-07T19:45:03.1246911Z 2025-05-07T19:45:03.1246914Z 2025-05-07T19:45:03.1246918Z 2025-05-07T19:45:03.1246921Z 2025-05-07T19:45:03.1246924Z 2025-05-07T19:45:03.1247351Z 2025-05-07T19:45:03.1518170Z harfbuzz-11.0.0 | 1.6 MB | ####2 | 43%  2025-05-07T19:45:03.1518521Z 2025-05-07T19:45:03.1518526Z 2025-05-07T19:45:03.1518530Z 2025-05-07T19:45:03.1518533Z 2025-05-07T19:45:03.1518537Z 2025-05-07T19:45:03.1518541Z 2025-05-07T19:45:03.1518544Z 2025-05-07T19:45:03.1518569Z 2025-05-07T19:45:03.1518574Z 2025-05-07T19:45:03.1518578Z 2025-05-07T19:45:03.1518581Z 2025-05-07T19:45:03.1518584Z 2025-05-07T19:45:03.1518588Z 2025-05-07T19:45:03.1518591Z 2025-05-07T19:45:03.1518594Z 2025-05-07T19:45:03.1518600Z 2025-05-07T19:45:03.1518603Z 2025-05-07T19:45:03.1607884Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:03.1608284Z 2025-05-07T19:45:03.1608289Z 2025-05-07T19:45:03.1608292Z 2025-05-07T19:45:03.1608296Z 2025-05-07T19:45:03.1608299Z 2025-05-07T19:45:03.1608303Z 2025-05-07T19:45:03.1608306Z 2025-05-07T19:45:03.1608310Z 2025-05-07T19:45:03.1608313Z 2025-05-07T19:45:03.1608317Z 2025-05-07T19:45:03.1608320Z 2025-05-07T19:45:03.1608324Z 2025-05-07T19:45:03.1843468Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:03.1843842Z 2025-05-07T19:45:03.1843847Z 2025-05-07T19:45:03.1843851Z 2025-05-07T19:45:03.1843854Z 2025-05-07T19:45:03.1843858Z 2025-05-07T19:45:03.1843861Z 2025-05-07T19:45:03.1843865Z 2025-05-07T19:45:03.1843868Z 2025-05-07T19:45:03.1843872Z 2025-05-07T19:45:03.1843876Z 2025-05-07T19:45:03.1843880Z 2025-05-07T19:45:03.1843883Z 2025-05-07T19:45:03.1843887Z 2025-05-07T19:45:03.1843890Z 2025-05-07T19:45:03.1843894Z 2025-05-07T19:45:03.1843916Z 2025-05-07T19:45:03.1844164Z 2025-05-07T19:45:03.1844170Z 2025-05-07T19:45:03.1844173Z 2025-05-07T19:45:03.1938627Z ... (more hidden) ... 2025-05-07T19:45:03.1938971Z 2025-05-07T19:45:03.1938976Z 2025-05-07T19:45:03.1938980Z 2025-05-07T19:45:03.1938984Z 2025-05-07T19:45:03.1938987Z 2025-05-07T19:45:03.1938991Z 2025-05-07T19:45:03.1938994Z 2025-05-07T19:45:03.1938998Z 2025-05-07T19:45:03.1939001Z 2025-05-07T19:45:03.1939004Z 2025-05-07T19:45:03.1939008Z 2025-05-07T19:45:03.1939011Z 2025-05-07T19:45:03.1939015Z 2025-05-07T19:45:03.1939018Z 2025-05-07T19:45:03.1939021Z 2025-05-07T19:45:03.1939025Z 2025-05-07T19:45:03.1939028Z 2025-05-07T19:45:03.1939045Z 2025-05-07T19:45:03.2118214Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:03.2147025Z openjdk-23.0.2 | 181.4 MB | ####2 | 42% 2025-05-07T19:45:03.2147874Z 2025-05-07T19:45:03.2148328Z 2025-05-07T19:45:03.2148358Z 2025-05-07T19:45:03.2148370Z 2025-05-07T19:45:03.2148380Z 2025-05-07T19:45:03.2148391Z 2025-05-07T19:45:03.2148401Z 2025-05-07T19:45:03.2148412Z 2025-05-07T19:45:03.2148422Z 2025-05-07T19:45:03.2148432Z 2025-05-07T19:45:03.2148443Z 2025-05-07T19:45:03.2148453Z 2025-05-07T19:45:03.2148463Z 2025-05-07T19:45:03.2148473Z 2025-05-07T19:45:03.2148484Z 2025-05-07T19:45:03.2148495Z 2025-05-07T19:45:03.2148505Z 2025-05-07T19:45:03.2148515Z 2025-05-07T19:45:03.2148525Z 2025-05-07T19:45:03.2239910Z ... (more hidden) ... 2025-05-07T19:45:03.2240251Z 2025-05-07T19:45:03.2240255Z 2025-05-07T19:45:03.2240259Z 2025-05-07T19:45:03.2240263Z 2025-05-07T19:45:03.2240267Z 2025-05-07T19:45:03.2240270Z 2025-05-07T19:45:03.2240273Z 2025-05-07T19:45:03.2240277Z 2025-05-07T19:45:03.2240280Z 2025-05-07T19:45:03.2240284Z 2025-05-07T19:45:03.2240301Z 2025-05-07T19:45:03.2240304Z 2025-05-07T19:45:03.2240308Z 2025-05-07T19:45:03.2240325Z 2025-05-07T19:45:03.2240337Z 2025-05-07T19:45:03.2240341Z 2025-05-07T19:45:03.2240344Z 2025-05-07T19:45:03.2249537Z 2025-05-07T19:45:03.3231713Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:03.3486496Z openjdk-23.0.2 | 181.4 MB | ####5 | 46% 2025-05-07T19:45:03.3486777Z 2025-05-07T19:45:03.3486782Z 2025-05-07T19:45:03.3486786Z 2025-05-07T19:45:03.3486791Z 2025-05-07T19:45:03.3487989Z 2025-05-07T19:45:03.4233653Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:03.4547583Z openjdk-23.0.2 | 181.4 MB | ####9 | 49% 2025-05-07T19:45:03.4547899Z 2025-05-07T19:45:03.5326786Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:03.5327079Z 2025-05-07T19:45:03.5327109Z 2025-05-07T19:45:03.5327115Z 2025-05-07T19:45:03.5327134Z 2025-05-07T19:45:03.5327142Z 2025-05-07T19:45:03.5327148Z 2025-05-07T19:45:03.5327157Z 2025-05-07T19:45:03.5327229Z 2025-05-07T19:45:03.5706346Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:03.6050041Z openjdk-23.0.2 | 181.4 MB | #####2 | 53% 2025-05-07T19:45:03.6050336Z 2025-05-07T19:45:03.6050385Z 2025-05-07T19:45:03.6050390Z 2025-05-07T19:45:03.6050395Z 2025-05-07T19:45:03.6050400Z 2025-05-07T19:45:03.6050596Z 2025-05-07T19:45:03.6707360Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:03.7707525Z openjdk-23.0.2 | 181.4 MB | #####6 | 57% 2025-05-07T19:45:03.8709403Z openjdk-23.0.2 | 181.4 MB | ###### | 60% 2025-05-07T19:45:03.8943518Z openjdk-23.0.2 | 181.4 MB | ######4 | 65% 2025-05-07T19:45:03.8943838Z 2025-05-07T19:45:03.8943845Z 2025-05-07T19:45:03.8943850Z 2025-05-07T19:45:03.8943853Z 2025-05-07T19:45:03.8943858Z 2025-05-07T19:45:03.8943862Z 2025-05-07T19:45:03.8943869Z 2025-05-07T19:45:04.0019525Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:04.1027881Z openjdk-23.0.2 | 181.4 MB | ######8 | 68% 2025-05-07T19:45:04.2028586Z openjdk-23.0.2 | 181.4 MB | #######1 | 72% 2025-05-07T19:45:04.3103695Z openjdk-23.0.2 | 181.4 MB | #######5 | 75% 2025-05-07T19:45:04.4106644Z openjdk-23.0.2 | 181.4 MB | #######8 | 79% 2025-05-07T19:45:04.4547398Z openjdk-23.0.2 | 181.4 MB | ########3 | 83% 2025-05-07T19:45:04.4547916Z 2025-05-07T19:45:04.4548066Z 2025-05-07T19:45:04.4548073Z 2025-05-07T19:45:04.4548079Z 2025-05-07T19:45:04.4548188Z 2025-05-07T19:45:04.4548209Z 2025-05-07T19:45:04.4548231Z 2025-05-07T19:45:04.4548237Z 2025-05-07T19:45:04.4548244Z 2025-05-07T19:45:04.4548248Z 2025-05-07T19:45:04.5614158Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:04.6614050Z openjdk-23.0.2 | 181.4 MB | ########6 | 87% 2025-05-07T19:45:04.7613990Z openjdk-23.0.2 | 181.4 MB | #########1 | 91% 2025-05-07T19:45:04.8670923Z openjdk-23.0.2 | 181.4 MB | #########5 | 96% 2025-05-07T19:45:04.9224831Z openjdk-23.0.2 | 181.4 MB | #########9 | 99% 2025-05-07T19:45:04.9225365Z 2025-05-07T19:45:04.9225381Z 2025-05-07T19:45:04.9225387Z 2025-05-07T19:45:04.9225392Z 2025-05-07T19:45:04.9225398Z 2025-05-07T19:45:04.9225404Z 2025-05-07T19:45:04.9225420Z 2025-05-07T19:45:04.9225425Z 2025-05-07T19:45:04.9225431Z 2025-05-07T19:45:04.9225438Z 2025-05-07T19:45:04.9225446Z 2025-05-07T19:45:05.2296229Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:05.2296547Z 2025-05-07T19:45:05.2296552Z 2025-05-07T19:45:05.2296557Z 2025-05-07T19:45:05.2296560Z 2025-05-07T19:45:05.2296564Z 2025-05-07T19:45:05.2296567Z 2025-05-07T19:45:05.2296593Z 2025-05-07T19:45:05.2296597Z 2025-05-07T19:45:05.2296600Z 2025-05-07T19:45:05.3029862Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:05.3030274Z 2025-05-07T19:45:05.3030351Z 2025-05-07T19:45:05.3030357Z 2025-05-07T19:45:05.3030361Z 2025-05-07T19:45:05.3030364Z 2025-05-07T19:45:05.3030368Z 2025-05-07T19:45:05.3030371Z 2025-05-07T19:45:05.3030375Z 2025-05-07T19:45:05.3030378Z 2025-05-07T19:45:05.3030383Z 2025-05-07T19:45:05.3030386Z 2025-05-07T19:45:05.3030390Z 2025-05-07T19:45:05.3030393Z 2025-05-07T19:45:05.3030761Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.3031102Z 2025-05-07T19:45:05.3031106Z 2025-05-07T19:45:05.3031109Z 2025-05-07T19:45:05.3031113Z 2025-05-07T19:45:05.3031116Z 2025-05-07T19:45:05.3031119Z 2025-05-07T19:45:05.3031123Z 2025-05-07T19:45:05.3031126Z 2025-05-07T19:45:05.3031129Z 2025-05-07T19:45:05.3031133Z 2025-05-07T19:45:05.3031136Z 2025-05-07T19:45:05.3031139Z 2025-05-07T19:45:05.3031143Z 2025-05-07T19:45:05.3918472Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.3918865Z 2025-05-07T19:45:05.3918923Z 2025-05-07T19:45:05.3918926Z 2025-05-07T19:45:05.3918930Z 2025-05-07T19:45:05.3918934Z 2025-05-07T19:45:05.3918938Z 2025-05-07T19:45:05.3918941Z 2025-05-07T19:45:05.3918944Z 2025-05-07T19:45:05.3918948Z 2025-05-07T19:45:05.3918951Z 2025-05-07T19:45:05.3918954Z 2025-05-07T19:45:05.3918958Z 2025-05-07T19:45:05.3918961Z 2025-05-07T19:45:05.3918977Z 2025-05-07T19:45:05.3919293Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.3919619Z 2025-05-07T19:45:05.3919622Z 2025-05-07T19:45:05.3919626Z 2025-05-07T19:45:05.3919629Z 2025-05-07T19:45:05.3919633Z 2025-05-07T19:45:05.3919636Z 2025-05-07T19:45:05.3919639Z 2025-05-07T19:45:05.3919643Z 2025-05-07T19:45:05.3919646Z 2025-05-07T19:45:05.3919663Z 2025-05-07T19:45:05.3919666Z 2025-05-07T19:45:05.3919669Z 2025-05-07T19:45:05.3919674Z 2025-05-07T19:45:05.3919682Z 2025-05-07T19:45:05.5514616Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.5515081Z 2025-05-07T19:45:05.5515130Z 2025-05-07T19:45:05.5515134Z 2025-05-07T19:45:05.5515138Z 2025-05-07T19:45:05.5515142Z 2025-05-07T19:45:05.5515146Z 2025-05-07T19:45:05.5515149Z 2025-05-07T19:45:05.5515153Z 2025-05-07T19:45:05.5515156Z 2025-05-07T19:45:05.5515159Z 2025-05-07T19:45:05.5515163Z 2025-05-07T19:45:05.5515166Z 2025-05-07T19:45:05.5515170Z 2025-05-07T19:45:05.5515173Z 2025-05-07T19:45:05.5515177Z 2025-05-07T19:45:05.5515475Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.5515879Z 2025-05-07T19:45:05.5515883Z 2025-05-07T19:45:05.5515887Z 2025-05-07T19:45:05.5515890Z 2025-05-07T19:45:05.5515893Z 2025-05-07T19:45:05.5515897Z 2025-05-07T19:45:05.5515900Z 2025-05-07T19:45:05.5515903Z 2025-05-07T19:45:05.5515907Z 2025-05-07T19:45:05.5515910Z 2025-05-07T19:45:05.5515914Z 2025-05-07T19:45:05.5515917Z 2025-05-07T19:45:05.5515920Z 2025-05-07T19:45:05.5516114Z 2025-05-07T19:45:05.5516124Z 2025-05-07T19:45:05.6783145Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.6783494Z 2025-05-07T19:45:05.6783501Z 2025-05-07T19:45:05.7826360Z python-3.10.17 | 23.9 MB | ########## | 100%  2025-05-07T19:45:05.7826675Z 2025-05-07T19:45:05.7826680Z 2025-05-07T19:45:05.7826684Z 2025-05-07T19:45:05.7826687Z 2025-05-07T19:45:05.7826692Z 2025-05-07T19:45:05.7826695Z 2025-05-07T19:45:05.7826699Z 2025-05-07T19:45:05.7826702Z 2025-05-07T19:45:05.7826706Z 2025-05-07T19:45:05.7826709Z 2025-05-07T19:45:05.7826713Z 2025-05-07T19:45:05.7826717Z 2025-05-07T19:45:05.7826721Z 2025-05-07T19:45:05.7826738Z 2025-05-07T19:45:05.7826741Z 2025-05-07T19:45:05.7826744Z 2025-05-07T19:45:05.7826748Z 2025-05-07T19:45:05.7827051Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:05.7827359Z 2025-05-07T19:45:05.7827373Z 2025-05-07T19:45:05.7827422Z 2025-05-07T19:45:05.7827426Z 2025-05-07T19:45:05.7827444Z 2025-05-07T19:45:05.7827448Z 2025-05-07T19:45:05.7827451Z 2025-05-07T19:45:05.7827454Z 2025-05-07T19:45:05.7827458Z 2025-05-07T19:45:05.7827461Z 2025-05-07T19:45:05.7827464Z 2025-05-07T19:45:05.7827468Z 2025-05-07T19:45:05.7827471Z 2025-05-07T19:45:05.7827475Z 2025-05-07T19:45:05.7827478Z 2025-05-07T19:45:05.7827481Z 2025-05-07T19:45:05.7827485Z 2025-05-07T19:45:05.8121077Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:05.8121429Z 2025-05-07T19:45:05.8121434Z 2025-05-07T19:45:05.8121438Z 2025-05-07T19:45:05.8438351Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:05.8438679Z 2025-05-07T19:45:05.8438685Z 2025-05-07T19:45:05.8438690Z 2025-05-07T19:45:05.8438695Z 2025-05-07T19:45:05.8438700Z 2025-05-07T19:45:05.8438706Z 2025-05-07T19:45:05.8438712Z 2025-05-07T19:45:05.8438717Z 2025-05-07T19:45:05.8438750Z 2025-05-07T19:45:05.8438775Z 2025-05-07T19:45:05.8438779Z 2025-05-07T19:45:05.8438782Z 2025-05-07T19:45:05.8438786Z 2025-05-07T19:45:05.8438789Z 2025-05-07T19:45:05.8438793Z 2025-05-07T19:45:05.8438796Z 2025-05-07T19:45:05.8438799Z 2025-05-07T19:45:05.8438803Z 2025-05-07T19:45:05.8438806Z 2025-05-07T19:45:05.8439078Z ... (more hidden) ... 2025-05-07T19:45:05.8439376Z 2025-05-07T19:45:05.8439379Z 2025-05-07T19:45:05.8439383Z 2025-05-07T19:45:05.8439386Z 2025-05-07T19:45:05.8439390Z 2025-05-07T19:45:05.8439393Z 2025-05-07T19:45:05.8439396Z 2025-05-07T19:45:05.8439400Z 2025-05-07T19:45:05.8439403Z 2025-05-07T19:45:05.8439407Z 2025-05-07T19:45:05.8439410Z 2025-05-07T19:45:05.8439414Z 2025-05-07T19:45:05.8439432Z 2025-05-07T19:45:05.8439436Z 2025-05-07T19:45:05.8439439Z 2025-05-07T19:45:05.8439442Z 2025-05-07T19:45:05.8439446Z 2025-05-07T19:45:05.8439449Z 2025-05-07T19:45:05.8439452Z 2025-05-07T19:45:05.9198134Z ... (more hidden) ... 2025-05-07T19:45:05.9198520Z 2025-05-07T19:45:05.9198525Z 2025-05-07T19:45:05.9198529Z 2025-05-07T19:45:05.9198533Z 2025-05-07T19:45:05.9198536Z 2025-05-07T19:45:05.9198540Z 2025-05-07T19:45:05.9198543Z 2025-05-07T19:45:05.9198546Z 2025-05-07T19:45:05.9198550Z 2025-05-07T19:45:05.9198553Z 2025-05-07T19:45:05.9198557Z 2025-05-07T19:45:05.9198560Z 2025-05-07T19:45:05.9198863Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:05.9199177Z 2025-05-07T19:45:05.9199181Z 2025-05-07T19:45:05.9199185Z 2025-05-07T19:45:05.9199188Z 2025-05-07T19:45:05.9199192Z 2025-05-07T19:45:05.9199195Z 2025-05-07T19:45:05.9199199Z 2025-05-07T19:45:05.9199202Z 2025-05-07T19:45:05.9199205Z 2025-05-07T19:45:05.9199209Z 2025-05-07T19:45:05.9199212Z 2025-05-07T19:45:05.9199216Z 2025-05-07T19:45:06.0728923Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:06.0729537Z 2025-05-07T19:45:06.0729543Z 2025-05-07T19:45:06.0729547Z 2025-05-07T19:45:06.0729550Z 2025-05-07T19:45:06.0729554Z 2025-05-07T19:45:06.0729557Z 2025-05-07T19:45:06.0729560Z 2025-05-07T19:45:06.0729564Z 2025-05-07T19:45:06.0729567Z 2025-05-07T19:45:06.0729571Z 2025-05-07T19:45:06.0729588Z 2025-05-07T19:45:06.0729591Z 2025-05-07T19:45:06.0729595Z 2025-05-07T19:45:06.0729598Z 2025-05-07T19:45:06.0729601Z 2025-05-07T19:45:06.0729605Z 2025-05-07T19:45:06.0732564Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:06.0732907Z 2025-05-07T19:45:06.0732911Z 2025-05-07T19:45:06.0732939Z 2025-05-07T19:45:06.0732943Z 2025-05-07T19:45:06.0732946Z 2025-05-07T19:45:06.0732950Z 2025-05-07T19:45:06.0732953Z 2025-05-07T19:45:06.0732956Z 2025-05-07T19:45:06.0732960Z 2025-05-07T19:45:06.0732963Z 2025-05-07T19:45:06.0732966Z 2025-05-07T19:45:06.0732970Z 2025-05-07T19:45:06.0732973Z 2025-05-07T19:45:06.0732983Z 2025-05-07T19:45:06.0732990Z 2025-05-07T19:45:06.0732993Z 2025-05-07T19:45:06.1356125Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:06.1356498Z 2025-05-07T19:45:06.1356503Z 2025-05-07T19:45:06.1356507Z 2025-05-07T19:45:06.1356512Z 2025-05-07T19:45:06.1356515Z 2025-05-07T19:45:06.1356519Z 2025-05-07T19:45:06.1356523Z 2025-05-07T19:45:06.1356526Z 2025-05-07T19:45:06.1356529Z 2025-05-07T19:45:06.1356533Z 2025-05-07T19:45:06.1356538Z 2025-05-07T19:45:06.1356541Z 2025-05-07T19:45:06.1356549Z 2025-05-07T19:45:06.1356552Z 2025-05-07T19:45:06.1356557Z 2025-05-07T19:45:06.1356562Z 2025-05-07T19:45:06.1356578Z 2025-05-07T19:45:06.1356582Z 2025-05-07T19:45:06.1357167Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:06.1357483Z 2025-05-07T19:45:06.1357487Z 2025-05-07T19:45:06.1357505Z 2025-05-07T19:45:06.1357511Z 2025-05-07T19:45:06.1357535Z 2025-05-07T19:45:06.1357575Z 2025-05-07T19:45:06.1357579Z 2025-05-07T19:45:06.1357582Z 2025-05-07T19:45:06.1357586Z 2025-05-07T19:45:06.1357589Z 2025-05-07T19:45:06.1357592Z 2025-05-07T19:45:06.1357596Z 2025-05-07T19:45:06.1357599Z 2025-05-07T19:45:06.1357603Z 2025-05-07T19:45:06.1357606Z 2025-05-07T19:45:06.1357610Z 2025-05-07T19:45:06.1357613Z 2025-05-07T19:45:06.1357616Z 2025-05-07T19:45:06.8999240Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:07.6223954Z openjdk-23.0.2 | 181.4 MB | ########## | 100% 2025-05-07T19:45:07.6224297Z 2025-05-07T19:45:08.6454173Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:08.6460407Z openjdk-23.0.2 | 181.4 MB | ########## | 100% 2025-05-07T19:45:08.6460663Z 2025-05-07T19:45:08.6460669Z 2025-05-07T19:45:08.6460673Z 2025-05-07T19:45:08.6460680Z 2025-05-07T19:45:08.6460683Z 2025-05-07T19:45:08.6460687Z 2025-05-07T19:45:08.6460740Z 2025-05-07T19:45:08.6461042Z 2025-05-07T19:45:08.6461066Z 2025-05-07T19:45:08.6461070Z 2025-05-07T19:45:08.6461073Z 2025-05-07T19:45:08.6461077Z 2025-05-07T19:45:08.6461080Z 2025-05-07T19:45:08.6461084Z 2025-05-07T19:45:08.6461087Z 2025-05-07T19:45:08.6461091Z 2025-05-07T19:45:08.6461094Z 2025-05-07T19:45:08.6461098Z 2025-05-07T19:45:08.6461101Z 2025-05-07T19:45:08.6461195Z 2025-05-07T19:45:08.6461587Z  2025-05-07T19:45:08.6461924Z 2025-05-07T19:45:08.6462146Z 2025-05-07T19:45:08.6462315Z  2025-05-07T19:45:08.6462521Z 2025-05-07T19:45:08.6462525Z 2025-05-07T19:45:08.6465802Z  2025-05-07T19:45:08.6466057Z 2025-05-07T19:45:08.6466061Z 2025-05-07T19:45:08.6466070Z 2025-05-07T19:45:08.6466266Z  2025-05-07T19:45:08.6466650Z 2025-05-07T19:45:08.6466654Z 2025-05-07T19:45:08.6466657Z 2025-05-07T19:45:08.6466661Z 2025-05-07T19:45:08.6466836Z  2025-05-07T19:45:08.6467102Z 2025-05-07T19:45:08.6467106Z 2025-05-07T19:45:08.6467109Z 2025-05-07T19:45:08.6467113Z 2025-05-07T19:45:08.6467116Z 2025-05-07T19:45:08.6467310Z  2025-05-07T19:45:08.6467546Z 2025-05-07T19:45:08.6467550Z 2025-05-07T19:45:08.6467553Z 2025-05-07T19:45:08.6467557Z 2025-05-07T19:45:08.6467588Z 2025-05-07T19:45:08.6467592Z 2025-05-07T19:45:08.6467791Z  2025-05-07T19:45:08.6468027Z 2025-05-07T19:45:08.6468030Z 2025-05-07T19:45:08.6468034Z 2025-05-07T19:45:08.6468037Z 2025-05-07T19:45:08.6468041Z 2025-05-07T19:45:08.6468044Z 2025-05-07T19:45:08.6468048Z 2025-05-07T19:45:08.6468287Z  2025-05-07T19:45:08.6468536Z 2025-05-07T19:45:08.6468540Z 2025-05-07T19:45:08.6468543Z 2025-05-07T19:45:08.6468547Z 2025-05-07T19:45:08.6468550Z 2025-05-07T19:45:08.6468554Z 2025-05-07T19:45:08.6468557Z 2025-05-07T19:45:08.6468560Z 2025-05-07T19:45:08.6468785Z  2025-05-07T19:45:08.6469023Z 2025-05-07T19:45:08.6469027Z 2025-05-07T19:45:08.6469030Z 2025-05-07T19:45:08.6469034Z 2025-05-07T19:45:08.6469037Z 2025-05-07T19:45:08.6469041Z 2025-05-07T19:45:08.6469044Z 2025-05-07T19:45:08.6469107Z 2025-05-07T19:45:08.6469111Z 2025-05-07T19:45:08.6469316Z  2025-05-07T19:45:08.6469561Z 2025-05-07T19:45:08.6469565Z 2025-05-07T19:45:08.6469569Z 2025-05-07T19:45:08.6469572Z 2025-05-07T19:45:08.6469576Z 2025-05-07T19:45:08.6469579Z 2025-05-07T19:45:08.6469582Z 2025-05-07T19:45:08.6469612Z 2025-05-07T19:45:08.6469620Z 2025-05-07T19:45:08.6469627Z 2025-05-07T19:45:08.6469842Z  2025-05-07T19:45:08.6470088Z 2025-05-07T19:45:08.6470091Z 2025-05-07T19:45:08.6470095Z 2025-05-07T19:45:08.6470098Z 2025-05-07T19:45:08.6470102Z 2025-05-07T19:45:08.6470105Z 2025-05-07T19:45:08.6470108Z 2025-05-07T19:45:08.6470144Z 2025-05-07T19:45:08.6470148Z 2025-05-07T19:45:08.6470151Z 2025-05-07T19:45:08.6470155Z 2025-05-07T19:45:08.6470380Z  2025-05-07T19:45:08.6470635Z 2025-05-07T19:45:08.6470639Z 2025-05-07T19:45:08.6470642Z 2025-05-07T19:45:08.6470646Z 2025-05-07T19:45:08.6470649Z 2025-05-07T19:45:08.6470653Z 2025-05-07T19:45:08.6470686Z 2025-05-07T19:45:08.6470689Z 2025-05-07T19:45:08.6470693Z 2025-05-07T19:45:08.6470696Z 2025-05-07T19:45:08.6470700Z 2025-05-07T19:45:08.6470703Z 2025-05-07T19:45:08.6470985Z  2025-05-07T19:45:08.6471245Z 2025-05-07T19:45:08.6471248Z 2025-05-07T19:45:08.6471252Z 2025-05-07T19:45:08.6471255Z 2025-05-07T19:45:08.6471289Z 2025-05-07T19:45:08.6471293Z 2025-05-07T19:45:08.6471296Z 2025-05-07T19:45:08.6471300Z 2025-05-07T19:45:08.6471304Z 2025-05-07T19:45:08.6471307Z 2025-05-07T19:45:08.6471310Z 2025-05-07T19:45:08.6471314Z 2025-05-07T19:45:08.6471317Z 2025-05-07T19:45:08.6471536Z  2025-05-07T19:45:08.6471786Z 2025-05-07T19:45:08.6471816Z 2025-05-07T19:45:08.6471819Z 2025-05-07T19:45:08.6471822Z 2025-05-07T19:45:08.6471826Z 2025-05-07T19:45:08.6471829Z 2025-05-07T19:45:08.6471833Z 2025-05-07T19:45:08.6471836Z 2025-05-07T19:45:08.6471839Z 2025-05-07T19:45:08.6471843Z 2025-05-07T19:45:08.6471846Z 2025-05-07T19:45:08.6471849Z 2025-05-07T19:45:08.6471853Z 2025-05-07T19:45:08.6471856Z 2025-05-07T19:45:08.6472086Z  2025-05-07T19:45:08.6472440Z 2025-05-07T19:45:08.6472444Z 2025-05-07T19:45:08.6472447Z 2025-05-07T19:45:08.6472451Z 2025-05-07T19:45:08.6472454Z 2025-05-07T19:45:08.6472458Z 2025-05-07T19:45:08.6472461Z 2025-05-07T19:45:08.6472465Z 2025-05-07T19:45:08.6472468Z 2025-05-07T19:45:08.6472472Z 2025-05-07T19:45:08.6472475Z 2025-05-07T19:45:08.6472478Z 2025-05-07T19:45:08.6472482Z 2025-05-07T19:45:08.6472485Z 2025-05-07T19:45:08.6472488Z 2025-05-07T19:45:08.6472752Z  2025-05-07T19:45:08.6473018Z 2025-05-07T19:45:08.6473022Z 2025-05-07T19:45:08.6473025Z 2025-05-07T19:45:08.6473028Z 2025-05-07T19:45:08.6473032Z 2025-05-07T19:45:08.6473035Z 2025-05-07T19:45:08.6473039Z 2025-05-07T19:45:08.6473042Z 2025-05-07T19:45:08.6473045Z 2025-05-07T19:45:08.6473049Z 2025-05-07T19:45:08.6473052Z 2025-05-07T19:45:08.6473060Z 2025-05-07T19:45:08.6473067Z 2025-05-07T19:45:08.6473070Z 2025-05-07T19:45:08.6473075Z 2025-05-07T19:45:08.6473123Z 2025-05-07T19:45:08.6473360Z  2025-05-07T19:45:08.6473622Z 2025-05-07T19:45:08.6473626Z 2025-05-07T19:45:08.6473629Z 2025-05-07T19:45:08.6473632Z 2025-05-07T19:45:08.6473636Z 2025-05-07T19:45:08.6473639Z 2025-05-07T19:45:08.6473642Z 2025-05-07T19:45:08.6473646Z 2025-05-07T19:45:08.6473718Z 2025-05-07T19:45:08.6473722Z 2025-05-07T19:45:08.6473725Z 2025-05-07T19:45:08.6473728Z 2025-05-07T19:45:08.6473732Z 2025-05-07T19:45:08.6473735Z 2025-05-07T19:45:08.6473738Z 2025-05-07T19:45:08.6473742Z 2025-05-07T19:45:08.6473745Z 2025-05-07T19:45:08.6474012Z  2025-05-07T19:45:08.6474281Z 2025-05-07T19:45:08.6474285Z 2025-05-07T19:45:08.6474289Z 2025-05-07T19:45:08.6474292Z 2025-05-07T19:45:08.6474303Z 2025-05-07T19:45:08.6474306Z 2025-05-07T19:45:08.6474310Z 2025-05-07T19:45:08.6474313Z 2025-05-07T19:45:08.6474317Z 2025-05-07T19:45:08.6474320Z 2025-05-07T19:45:08.6474324Z 2025-05-07T19:45:08.6474327Z 2025-05-07T19:45:08.6474360Z 2025-05-07T19:45:08.6474364Z 2025-05-07T19:45:08.6474367Z 2025-05-07T19:45:08.6474371Z 2025-05-07T19:45:08.6474374Z 2025-05-07T19:45:08.6474377Z 2025-05-07T19:45:08.6474619Z  2025-05-07T19:45:08.6474885Z 2025-05-07T19:45:08.6474889Z 2025-05-07T19:45:08.6475032Z  2025-05-07T19:45:08.6475157Z 2025-05-07T19:45:08.6475161Z 2025-05-07T19:45:08.6475273Z  2025-05-07T19:45:08.6475434Z 2025-05-07T19:45:08.6475438Z 2025-05-07T19:45:08.6475441Z 2025-05-07T19:45:08.6475559Z  2025-05-07T19:45:08.6475684Z 2025-05-07T19:45:08.6475688Z 2025-05-07T19:45:08.6475691Z 2025-05-07T19:45:08.6475695Z 2025-05-07T19:45:08.6475949Z  2025-05-07T19:45:08.6476164Z 2025-05-07T19:45:08.6476168Z 2025-05-07T19:45:08.6476172Z 2025-05-07T19:45:08.6476175Z 2025-05-07T19:45:08.6476179Z 2025-05-07T19:45:08.6476307Z  2025-05-07T19:45:08.6476481Z 2025-05-07T19:45:08.6476484Z 2025-05-07T19:45:08.6476488Z 2025-05-07T19:45:08.6476492Z 2025-05-07T19:45:08.6476495Z 2025-05-07T19:45:08.6476499Z 2025-05-07T19:45:08.6476627Z  2025-05-07T19:45:08.6476772Z 2025-05-07T19:45:08.6476803Z 2025-05-07T19:45:08.6476807Z 2025-05-07T19:45:08.6476810Z 2025-05-07T19:45:08.6476813Z 2025-05-07T19:45:08.6476817Z 2025-05-07T19:45:08.6476820Z 2025-05-07T19:45:08.6476950Z  2025-05-07T19:45:08.6477110Z 2025-05-07T19:45:08.6477114Z 2025-05-07T19:45:08.6477117Z 2025-05-07T19:45:08.6477121Z 2025-05-07T19:45:08.6477124Z 2025-05-07T19:45:08.6477161Z 2025-05-07T19:45:08.6477164Z 2025-05-07T19:45:08.6477167Z 2025-05-07T19:45:08.6477304Z  2025-05-07T19:45:08.6477480Z 2025-05-07T19:45:08.6477549Z 2025-05-07T19:45:08.6477552Z 2025-05-07T19:45:08.6477556Z 2025-05-07T19:45:08.6477559Z 2025-05-07T19:45:08.6477563Z 2025-05-07T19:45:08.6477566Z 2025-05-07T19:45:08.6477569Z 2025-05-07T19:45:08.6477604Z 2025-05-07T19:45:08.6477748Z  2025-05-07T19:45:08.6477927Z 2025-05-07T19:45:08.6477931Z 2025-05-07T19:45:08.6477934Z 2025-05-07T19:45:08.6477938Z 2025-05-07T19:45:08.6477941Z 2025-05-07T19:45:08.6477944Z 2025-05-07T19:45:08.6477948Z 2025-05-07T19:45:08.6477951Z 2025-05-07T19:45:08.6477955Z 2025-05-07T19:45:08.6477958Z 2025-05-07T19:45:08.6478130Z  2025-05-07T19:45:08.6478311Z 2025-05-07T19:45:08.6478315Z 2025-05-07T19:45:08.6478318Z 2025-05-07T19:45:08.6478322Z 2025-05-07T19:45:08.6478325Z 2025-05-07T19:45:08.6478329Z 2025-05-07T19:45:08.6478332Z 2025-05-07T19:45:08.6478335Z 2025-05-07T19:45:08.6478339Z 2025-05-07T19:45:08.6478342Z 2025-05-07T19:45:08.6478346Z 2025-05-07T19:45:08.6478518Z  2025-05-07T19:45:08.6478723Z 2025-05-07T19:45:08.6478727Z 2025-05-07T19:45:08.6478730Z 2025-05-07T19:45:08.6478733Z 2025-05-07T19:45:08.6478737Z 2025-05-07T19:45:08.6478740Z 2025-05-07T19:45:08.6478744Z 2025-05-07T19:45:08.6478747Z 2025-05-07T19:45:08.6478750Z 2025-05-07T19:45:08.6478754Z 2025-05-07T19:45:08.6478793Z 2025-05-07T19:45:08.6478796Z 2025-05-07T19:45:08.6478950Z  2025-05-07T19:45:08.6479159Z 2025-05-07T19:45:08.6479163Z 2025-05-07T19:45:08.6479166Z 2025-05-07T19:45:08.6479170Z 2025-05-07T19:45:08.6479173Z 2025-05-07T19:45:08.6479176Z 2025-05-07T19:45:08.6479180Z 2025-05-07T19:45:08.6479183Z 2025-05-07T19:45:08.6479187Z 2025-05-07T19:45:08.6479221Z 2025-05-07T19:45:08.6479224Z 2025-05-07T19:45:08.6479228Z 2025-05-07T19:45:08.6479231Z 2025-05-07T19:45:08.6479389Z  2025-05-07T19:45:08.6479596Z 2025-05-07T19:45:08.6479600Z 2025-05-07T19:45:08.6479603Z 2025-05-07T19:45:08.6479610Z 2025-05-07T19:45:08.6479617Z 2025-05-07T19:45:08.6479620Z 2025-05-07T19:45:08.6479624Z 2025-05-07T19:45:08.6479661Z 2025-05-07T19:45:08.6479665Z 2025-05-07T19:45:08.6479668Z 2025-05-07T19:45:08.6479672Z 2025-05-07T19:45:08.6479675Z 2025-05-07T19:45:08.6479679Z 2025-05-07T19:45:08.6479682Z 2025-05-07T19:45:08.6479845Z  2025-05-07T19:45:08.6480071Z 2025-05-07T19:45:08.6480075Z 2025-05-07T19:45:08.6480078Z 2025-05-07T19:45:08.6480113Z 2025-05-07T19:45:08.6480117Z 2025-05-07T19:45:08.6480120Z 2025-05-07T19:45:08.6480124Z 2025-05-07T19:45:08.6480127Z 2025-05-07T19:45:08.6480131Z 2025-05-07T19:45:08.6480134Z 2025-05-07T19:45:08.6480138Z 2025-05-07T19:45:08.6480141Z 2025-05-07T19:45:08.6480145Z 2025-05-07T19:45:08.6480148Z 2025-05-07T19:45:08.6480152Z 2025-05-07T19:45:08.6480316Z  2025-05-07T19:45:08.6480571Z 2025-05-07T19:45:08.6480575Z 2025-05-07T19:45:08.6480578Z 2025-05-07T19:45:08.6480582Z 2025-05-07T19:45:08.6480640Z 2025-05-07T19:45:08.6480644Z 2025-05-07T19:45:08.6480647Z 2025-05-07T19:45:08.6480651Z 2025-05-07T19:45:08.6480654Z 2025-05-07T19:45:08.6480658Z 2025-05-07T19:45:08.6480661Z 2025-05-07T19:45:08.6480664Z 2025-05-07T19:45:08.6480668Z 2025-05-07T19:45:08.6480671Z 2025-05-07T19:45:08.6480675Z 2025-05-07T19:45:08.6480678Z 2025-05-07T19:45:08.6480855Z  2025-05-07T19:45:08.6481124Z 2025-05-07T19:45:08.6481128Z 2025-05-07T19:45:08.6481131Z 2025-05-07T19:45:08.6481135Z 2025-05-07T19:45:08.6481138Z 2025-05-07T19:45:08.6481141Z 2025-05-07T19:45:08.6481145Z 2025-05-07T19:45:08.6481148Z 2025-05-07T19:45:08.6481152Z 2025-05-07T19:45:08.6481155Z 2025-05-07T19:45:08.6481159Z 2025-05-07T19:45:08.6481162Z 2025-05-07T19:45:08.6481165Z 2025-05-07T19:45:08.6481169Z 2025-05-07T19:45:08.6481172Z 2025-05-07T19:45:08.6481176Z 2025-05-07T19:45:08.6481179Z 2025-05-07T19:45:08.6481390Z  2025-05-07T19:45:08.6482704Z 2025-05-07T19:45:08.6482708Z 2025-05-07T19:45:08.6482711Z 2025-05-07T19:45:08.6482715Z 2025-05-07T19:45:08.6482718Z 2025-05-07T19:45:08.6482721Z 2025-05-07T19:45:08.6482725Z 2025-05-07T19:45:08.6482729Z 2025-05-07T19:45:08.6482733Z 2025-05-07T19:45:08.6482736Z 2025-05-07T19:45:08.6482740Z 2025-05-07T19:45:08.6482779Z 2025-05-07T19:45:08.6482782Z 2025-05-07T19:45:08.6482786Z 2025-05-07T19:45:08.6482789Z 2025-05-07T19:45:08.6482792Z 2025-05-07T19:45:08.6482796Z 2025-05-07T19:45:08.6482799Z 2025-05-07T19:45:08.6483075Z  2025-05-07T19:45:08.6483314Z 2025-05-07T19:45:08.6483318Z 2025-05-07T19:45:08.6483435Z  2025-05-07T19:45:08.6483589Z 2025-05-07T19:45:08.6483592Z 2025-05-07T19:45:08.6483709Z  2025-05-07T19:45:08.6483837Z 2025-05-07T19:45:08.6483840Z 2025-05-07T19:45:08.6483844Z 2025-05-07T19:45:08.6483996Z  2025-05-07T19:45:08.6484125Z 2025-05-07T19:45:08.6484129Z 2025-05-07T19:45:08.6484136Z 2025-05-07T19:45:08.6484143Z 2025-05-07T19:45:08.6484268Z  2025-05-07T19:45:08.6484437Z 2025-05-07T19:45:08.6484441Z 2025-05-07T19:45:08.6484444Z 2025-05-07T19:45:08.6484448Z 2025-05-07T19:45:08.6484451Z 2025-05-07T19:45:08.6484578Z  2025-05-07T19:45:08.6484720Z 2025-05-07T19:45:08.6484723Z 2025-05-07T19:45:08.6484727Z 2025-05-07T19:45:08.6484731Z 2025-05-07T19:45:08.6484734Z 2025-05-07T19:45:08.6484738Z 2025-05-07T19:45:08.6484902Z  2025-05-07T19:45:08.6485053Z 2025-05-07T19:45:08.6485057Z 2025-05-07T19:45:08.6485060Z 2025-05-07T19:45:08.6485063Z 2025-05-07T19:45:08.6485067Z 2025-05-07T19:45:08.6485070Z 2025-05-07T19:45:08.6485073Z 2025-05-07T19:45:08.6485239Z  2025-05-07T19:45:08.6485397Z 2025-05-07T19:45:08.6485400Z 2025-05-07T19:45:08.6485404Z 2025-05-07T19:45:08.6485407Z 2025-05-07T19:45:08.6485410Z 2025-05-07T19:45:08.6485414Z 2025-05-07T19:45:08.6485417Z 2025-05-07T19:45:08.6485421Z 2025-05-07T19:45:08.6485564Z  2025-05-07T19:45:08.6485769Z 2025-05-07T19:45:08.6485773Z 2025-05-07T19:45:08.6485777Z 2025-05-07T19:45:08.6485780Z 2025-05-07T19:45:08.6485783Z 2025-05-07T19:45:08.6485787Z 2025-05-07T19:45:08.6485790Z 2025-05-07T19:45:08.6485793Z 2025-05-07T19:45:08.6485797Z 2025-05-07T19:45:08.6485935Z  2025-05-07T19:45:08.6486145Z 2025-05-07T19:45:08.6486149Z 2025-05-07T19:45:08.6486152Z 2025-05-07T19:45:08.6486156Z 2025-05-07T19:45:08.6486159Z 2025-05-07T19:45:08.6486163Z 2025-05-07T19:45:08.6486166Z 2025-05-07T19:45:08.6486170Z 2025-05-07T19:45:08.6486173Z 2025-05-07T19:45:08.6486176Z 2025-05-07T19:45:08.6486317Z  2025-05-07T19:45:08.6486526Z 2025-05-07T19:45:08.6486529Z 2025-05-07T19:45:08.6486533Z 2025-05-07T19:45:08.6486536Z 2025-05-07T19:45:08.6486540Z 2025-05-07T19:45:08.6486543Z 2025-05-07T19:45:08.6486547Z 2025-05-07T19:45:08.6486550Z 2025-05-07T19:45:08.6486553Z 2025-05-07T19:45:08.6486557Z 2025-05-07T19:45:08.6486617Z 2025-05-07T19:45:08.6486759Z  2025-05-07T19:45:08.6486973Z 2025-05-07T19:45:08.6486977Z 2025-05-07T19:45:08.6486980Z 2025-05-07T19:45:08.6486984Z 2025-05-07T19:45:08.6486987Z 2025-05-07T19:45:08.6486990Z 2025-05-07T19:45:08.6486994Z 2025-05-07T19:45:08.6486997Z 2025-05-07T19:45:08.6487000Z 2025-05-07T19:45:08.6487004Z 2025-05-07T19:45:08.6487007Z 2025-05-07T19:45:08.6487010Z 2025-05-07T19:45:08.6487155Z  2025-05-07T19:45:08.6487381Z 2025-05-07T19:45:08.6487385Z 2025-05-07T19:45:08.6487389Z 2025-05-07T19:45:08.6487392Z 2025-05-07T19:45:08.6487395Z 2025-05-07T19:45:08.6487399Z 2025-05-07T19:45:08.6487402Z 2025-05-07T19:45:08.6487406Z 2025-05-07T19:45:08.6487409Z 2025-05-07T19:45:08.6487412Z 2025-05-07T19:45:08.6487416Z 2025-05-07T19:45:08.6487419Z 2025-05-07T19:45:08.6487422Z 2025-05-07T19:45:08.6487573Z  2025-05-07T19:45:08.6487808Z 2025-05-07T19:45:08.6487868Z 2025-05-07T19:45:08.6487875Z 2025-05-07T19:45:08.6487878Z 2025-05-07T19:45:08.6487882Z 2025-05-07T19:45:08.6487885Z 2025-05-07T19:45:08.6487889Z 2025-05-07T19:45:08.6487892Z 2025-05-07T19:45:08.6487895Z 2025-05-07T19:45:08.6487899Z 2025-05-07T19:45:08.6487902Z 2025-05-07T19:45:08.6487906Z 2025-05-07T19:45:08.6487909Z 2025-05-07T19:45:08.6487912Z 2025-05-07T19:45:08.6488098Z  2025-05-07T19:45:08.6488312Z 2025-05-07T19:45:08.6488315Z 2025-05-07T19:45:08.6488319Z 2025-05-07T19:45:08.6488322Z 2025-05-07T19:45:08.6488325Z 2025-05-07T19:45:08.6488329Z 2025-05-07T19:45:08.6488332Z 2025-05-07T19:45:08.6488336Z 2025-05-07T19:45:08.6488339Z 2025-05-07T19:45:08.6488343Z 2025-05-07T19:45:08.6488346Z 2025-05-07T19:45:08.6488349Z 2025-05-07T19:45:08.6488353Z 2025-05-07T19:45:08.6488356Z 2025-05-07T19:45:08.6488385Z 2025-05-07T19:45:08.6488548Z  2025-05-07T19:45:08.6488766Z 2025-05-07T19:45:08.6488774Z 2025-05-07T19:45:08.6488780Z 2025-05-07T19:45:08.6488784Z 2025-05-07T19:45:08.6488787Z 2025-05-07T19:45:08.6488790Z 2025-05-07T19:45:08.6488794Z 2025-05-07T19:45:08.6488797Z 2025-05-07T19:45:08.6488800Z 2025-05-07T19:45:08.6488804Z 2025-05-07T19:45:08.6488835Z 2025-05-07T19:45:08.6488838Z 2025-05-07T19:45:08.6488842Z 2025-05-07T19:45:08.6488845Z 2025-05-07T19:45:08.6488848Z 2025-05-07T19:45:08.6488852Z 2025-05-07T19:45:08.6489021Z  2025-05-07T19:45:08.6489244Z 2025-05-07T19:45:08.6489248Z 2025-05-07T19:45:08.6489251Z 2025-05-07T19:45:08.6489255Z 2025-05-07T19:45:08.6489285Z 2025-05-07T19:45:08.6489288Z 2025-05-07T19:45:08.6489292Z 2025-05-07T19:45:08.6489295Z 2025-05-07T19:45:08.6489298Z 2025-05-07T19:45:08.6489302Z 2025-05-07T19:45:08.6489305Z 2025-05-07T19:45:08.6489309Z 2025-05-07T19:45:08.6489312Z 2025-05-07T19:45:08.6489316Z 2025-05-07T19:45:08.6489319Z 2025-05-07T19:45:08.6489322Z 2025-05-07T19:45:08.6489326Z 2025-05-07T19:45:08.6489509Z  2025-05-07T19:45:08.6489764Z 2025-05-07T19:45:08.6489768Z 2025-05-07T19:45:08.6489771Z 2025-05-07T19:45:08.6489774Z 2025-05-07T19:45:08.6489778Z 2025-05-07T19:45:08.6489781Z 2025-05-07T19:45:08.6489785Z 2025-05-07T19:45:08.6489788Z 2025-05-07T19:45:08.6489791Z 2025-05-07T19:45:08.6489795Z 2025-05-07T19:45:08.6489798Z 2025-05-07T19:45:08.6489801Z 2025-05-07T19:45:08.6489805Z 2025-05-07T19:45:08.6489808Z 2025-05-07T19:45:08.6489811Z 2025-05-07T19:45:08.6489815Z 2025-05-07T19:45:08.6489818Z 2025-05-07T19:45:08.6489822Z 2025-05-07T19:45:08.6490029Z  2025-05-07T19:45:08.6490268Z 2025-05-07T19:45:08.6490272Z 2025-05-07T19:45:08.6490382Z  2025-05-07T19:45:08.6490535Z 2025-05-07T19:45:08.6490539Z 2025-05-07T19:45:08.6490694Z  2025-05-07T19:45:08.6490866Z 2025-05-07T19:45:08.6490872Z 2025-05-07T19:45:08.6490877Z 2025-05-07T19:45:08.6491044Z  2025-05-07T19:45:08.6491173Z 2025-05-07T19:45:08.6491238Z 2025-05-07T19:45:08.6491242Z 2025-05-07T19:45:08.6491246Z 2025-05-07T19:45:08.6491369Z  2025-05-07T19:45:08.6491533Z 2025-05-07T19:45:08.6491536Z 2025-05-07T19:45:08.6491540Z 2025-05-07T19:45:08.6491543Z 2025-05-07T19:45:08.6491547Z 2025-05-07T19:45:08.6491671Z  2025-05-07T19:45:08.6491811Z 2025-05-07T19:45:08.6491815Z 2025-05-07T19:45:08.6491818Z 2025-05-07T19:45:08.6491822Z 2025-05-07T19:45:08.6491825Z 2025-05-07T19:45:08.6491829Z 2025-05-07T19:45:08.6491986Z  2025-05-07T19:45:08.6492130Z 2025-05-07T19:45:08.6492134Z 2025-05-07T19:45:08.6492137Z 2025-05-07T19:45:08.6492140Z 2025-05-07T19:45:08.6492144Z 2025-05-07T19:45:08.6492147Z 2025-05-07T19:45:08.6492151Z 2025-05-07T19:45:08.6492309Z  2025-05-07T19:45:08.6492466Z 2025-05-07T19:45:08.6492470Z 2025-05-07T19:45:08.6492473Z 2025-05-07T19:45:08.6492477Z 2025-05-07T19:45:08.6492480Z 2025-05-07T19:45:08.6492484Z 2025-05-07T19:45:08.6492541Z 2025-05-07T19:45:08.6492548Z 2025-05-07T19:45:08.6492833Z  2025-05-07T19:45:08.6493007Z 2025-05-07T19:45:08.6493010Z 2025-05-07T19:45:08.6493014Z 2025-05-07T19:45:08.6493017Z 2025-05-07T19:45:08.6493021Z 2025-05-07T19:45:08.6493024Z 2025-05-07T19:45:08.6493027Z 2025-05-07T19:45:08.6493031Z 2025-05-07T19:45:08.6493034Z 2025-05-07T19:45:08.6493173Z  2025-05-07T19:45:08.6493384Z 2025-05-07T19:45:08.6493387Z 2025-05-07T19:45:08.6493390Z 2025-05-07T19:45:08.6493394Z 2025-05-07T19:45:08.6493397Z 2025-05-07T19:45:08.6493401Z 2025-05-07T19:45:08.6493404Z 2025-05-07T19:45:08.6493407Z 2025-05-07T19:45:08.6493411Z 2025-05-07T19:45:08.6493414Z 2025-05-07T19:45:08.6493559Z  2025-05-07T19:45:08.6493777Z 2025-05-07T19:45:08.6493780Z 2025-05-07T19:45:08.6493784Z 2025-05-07T19:45:08.6493787Z 2025-05-07T19:45:08.6493791Z 2025-05-07T19:45:08.6493794Z 2025-05-07T19:45:08.6493797Z 2025-05-07T19:45:08.6493805Z 2025-05-07T19:45:08.6493811Z 2025-05-07T19:45:08.6493815Z 2025-05-07T19:45:08.6493818Z 2025-05-07T19:45:08.6494014Z  2025-05-07T19:45:08.6494278Z 2025-05-07T19:45:08.6494282Z 2025-05-07T19:45:08.6494285Z 2025-05-07T19:45:08.6494288Z 2025-05-07T19:45:08.6494292Z 2025-05-07T19:45:08.6494295Z 2025-05-07T19:45:08.6494299Z 2025-05-07T19:45:08.6494302Z 2025-05-07T19:45:08.6494306Z 2025-05-07T19:45:08.6494309Z 2025-05-07T19:45:08.6494313Z 2025-05-07T19:45:08.6494316Z 2025-05-07T19:45:08.6494466Z  2025-05-07T19:45:08.6494694Z 2025-05-07T19:45:08.6494698Z 2025-05-07T19:45:08.6494701Z 2025-05-07T19:45:08.6494704Z 2025-05-07T19:45:08.6494708Z 2025-05-07T19:45:08.6494711Z 2025-05-07T19:45:08.6494715Z 2025-05-07T19:45:08.6494719Z 2025-05-07T19:45:08.6494722Z 2025-05-07T19:45:08.6494725Z 2025-05-07T19:45:08.6494729Z 2025-05-07T19:45:08.6494732Z 2025-05-07T19:45:08.6494735Z 2025-05-07T19:45:08.6494914Z  2025-05-07T19:45:08.6495131Z 2025-05-07T19:45:08.6495135Z 2025-05-07T19:45:08.6495138Z 2025-05-07T19:45:08.6495142Z 2025-05-07T19:45:08.6495145Z 2025-05-07T19:45:08.6495149Z 2025-05-07T19:45:08.6495152Z 2025-05-07T19:45:08.6495155Z 2025-05-07T19:45:08.6495159Z 2025-05-07T19:45:08.6495162Z 2025-05-07T19:45:08.6495165Z 2025-05-07T19:45:08.6495169Z 2025-05-07T19:45:08.6495172Z 2025-05-07T19:45:08.6495176Z 2025-05-07T19:45:08.6495383Z  2025-05-07T19:45:08.6495636Z 2025-05-07T19:45:08.6495640Z 2025-05-07T19:45:08.6495643Z 2025-05-07T19:45:08.6495646Z 2025-05-07T19:45:08.6495650Z 2025-05-07T19:45:08.6495653Z 2025-05-07T19:45:08.6495656Z 2025-05-07T19:45:08.6495660Z 2025-05-07T19:45:08.6495663Z 2025-05-07T19:45:08.6495666Z 2025-05-07T19:45:08.6495670Z 2025-05-07T19:45:08.6495673Z 2025-05-07T19:45:08.6495677Z 2025-05-07T19:45:08.6495706Z 2025-05-07T19:45:08.6495709Z 2025-05-07T19:45:08.6495870Z  2025-05-07T19:45:08.6496154Z 2025-05-07T19:45:08.6496158Z 2025-05-07T19:45:08.6496161Z 2025-05-07T19:45:08.6496165Z 2025-05-07T19:45:08.6496168Z 2025-05-07T19:45:08.6496171Z 2025-05-07T19:45:08.6496175Z 2025-05-07T19:45:08.6496178Z 2025-05-07T19:45:08.6496207Z 2025-05-07T19:45:08.6496211Z 2025-05-07T19:45:08.6496214Z 2025-05-07T19:45:08.6496218Z 2025-05-07T19:45:08.6496221Z 2025-05-07T19:45:08.6496225Z 2025-05-07T19:45:08.6496228Z 2025-05-07T19:45:08.6496232Z 2025-05-07T19:45:08.6496397Z  2025-05-07T19:45:08.6496624Z 2025-05-07T19:45:08.6496627Z 2025-05-07T19:45:08.6496631Z 2025-05-07T19:45:08.6496661Z 2025-05-07T19:45:08.6496665Z 2025-05-07T19:45:08.6496668Z 2025-05-07T19:45:08.6496672Z 2025-05-07T19:45:08.6496675Z 2025-05-07T19:45:08.6496679Z 2025-05-07T19:45:08.6496682Z 2025-05-07T19:45:08.6496685Z 2025-05-07T19:45:08.6496689Z 2025-05-07T19:45:08.6496692Z 2025-05-07T19:45:08.6496696Z 2025-05-07T19:45:08.6496699Z 2025-05-07T19:45:08.6496761Z 2025-05-07T19:45:08.6496764Z 2025-05-07T19:45:08.6496937Z  2025-05-07T19:45:08.6497197Z 2025-05-07T19:45:08.6497201Z 2025-05-07T19:45:08.6497204Z 2025-05-07T19:45:08.6497208Z 2025-05-07T19:45:08.6497211Z 2025-05-07T19:45:08.6497214Z 2025-05-07T19:45:08.6497218Z 2025-05-07T19:45:08.6497221Z 2025-05-07T19:45:08.6497225Z 2025-05-07T19:45:08.6497228Z 2025-05-07T19:45:08.6497231Z 2025-05-07T19:45:08.6497235Z 2025-05-07T19:45:08.6497238Z 2025-05-07T19:45:08.6497242Z 2025-05-07T19:45:08.6497245Z 2025-05-07T19:45:08.6497248Z 2025-05-07T19:45:08.6497252Z 2025-05-07T19:45:08.6497255Z 2025-05-07T19:45:08.6497469Z  2025-05-07T19:45:08.6497708Z 2025-05-07T19:45:08.6497711Z 2025-05-07T19:45:08.6497822Z  2025-05-07T19:45:08.6497975Z 2025-05-07T19:45:08.6497979Z 2025-05-07T19:45:08.6498091Z  2025-05-07T19:45:08.6498213Z 2025-05-07T19:45:08.6498217Z 2025-05-07T19:45:08.6498224Z 2025-05-07T19:45:08.6498370Z  2025-05-07T19:45:08.6498494Z 2025-05-07T19:45:08.6498498Z 2025-05-07T19:45:08.6498501Z 2025-05-07T19:45:08.6498505Z 2025-05-07T19:45:08.6498624Z  2025-05-07T19:45:08.6498782Z 2025-05-07T19:45:08.6498785Z 2025-05-07T19:45:08.6498788Z 2025-05-07T19:45:08.6498792Z 2025-05-07T19:45:08.6498795Z 2025-05-07T19:45:08.6498915Z  2025-05-07T19:45:08.6499049Z 2025-05-07T19:45:08.6499053Z 2025-05-07T19:45:08.6499056Z 2025-05-07T19:45:08.6499060Z 2025-05-07T19:45:08.6499063Z 2025-05-07T19:45:08.6499094Z 2025-05-07T19:45:08.6499215Z  2025-05-07T19:45:08.6499357Z 2025-05-07T19:45:08.6499360Z 2025-05-07T19:45:08.6499364Z 2025-05-07T19:45:08.6499367Z 2025-05-07T19:45:08.6499371Z 2025-05-07T19:45:08.6499374Z 2025-05-07T19:45:08.6499378Z 2025-05-07T19:45:08.6499531Z  2025-05-07T19:45:08.6499682Z 2025-05-07T19:45:08.6499685Z 2025-05-07T19:45:08.6499689Z 2025-05-07T19:45:08.6499692Z 2025-05-07T19:45:08.6499702Z 2025-05-07T19:45:08.6499705Z 2025-05-07T19:45:08.6499708Z 2025-05-07T19:45:08.6499712Z 2025-05-07T19:45:08.6499874Z  2025-05-07T19:45:08.6500040Z 2025-05-07T19:45:08.6500044Z 2025-05-07T19:45:08.6500047Z 2025-05-07T19:45:08.6500051Z 2025-05-07T19:45:08.6500054Z 2025-05-07T19:45:08.6500058Z 2025-05-07T19:45:08.6500061Z 2025-05-07T19:45:08.6500064Z 2025-05-07T19:45:08.6500068Z 2025-05-07T19:45:08.6500202Z  2025-05-07T19:45:08.6500405Z 2025-05-07T19:45:08.6500409Z 2025-05-07T19:45:08.6500412Z 2025-05-07T19:45:08.6500416Z 2025-05-07T19:45:08.6500419Z 2025-05-07T19:45:08.6500422Z 2025-05-07T19:45:08.6500426Z 2025-05-07T19:45:08.6500429Z 2025-05-07T19:45:08.6500433Z 2025-05-07T19:45:08.6500436Z 2025-05-07T19:45:08.6500578Z  2025-05-07T19:45:08.6500783Z 2025-05-07T19:45:08.6500787Z 2025-05-07T19:45:08.6500791Z 2025-05-07T19:45:08.6500794Z 2025-05-07T19:45:08.6500797Z 2025-05-07T19:45:08.6500804Z 2025-05-07T19:45:08.6500873Z 2025-05-07T19:45:08.6500877Z 2025-05-07T19:45:08.6500880Z 2025-05-07T19:45:08.6500884Z 2025-05-07T19:45:08.6500887Z 2025-05-07T19:45:08.6501031Z  2025-05-07T19:45:08.6501248Z 2025-05-07T19:45:08.6501252Z 2025-05-07T19:45:08.6501255Z 2025-05-07T19:45:08.6501258Z 2025-05-07T19:45:08.6501262Z 2025-05-07T19:45:08.6501265Z 2025-05-07T19:45:08.6501268Z 2025-05-07T19:45:08.6501272Z 2025-05-07T19:45:08.6501275Z 2025-05-07T19:45:08.6501279Z 2025-05-07T19:45:08.6501282Z 2025-05-07T19:45:08.6501285Z 2025-05-07T19:45:08.6501434Z  2025-05-07T19:45:08.6501660Z 2025-05-07T19:45:08.6501664Z 2025-05-07T19:45:08.6501668Z 2025-05-07T19:45:08.6501671Z 2025-05-07T19:45:08.6501674Z 2025-05-07T19:45:08.6501678Z 2025-05-07T19:45:08.6501681Z 2025-05-07T19:45:08.6501684Z 2025-05-07T19:45:08.6501688Z 2025-05-07T19:45:08.6501691Z 2025-05-07T19:45:08.6501694Z 2025-05-07T19:45:08.6501698Z 2025-05-07T19:45:08.6501755Z 2025-05-07T19:45:08.6501935Z  2025-05-07T19:45:08.6502140Z 2025-05-07T19:45:08.6502144Z 2025-05-07T19:45:08.6502147Z 2025-05-07T19:45:08.6502150Z 2025-05-07T19:45:08.6502154Z 2025-05-07T19:45:08.6502157Z 2025-05-07T19:45:08.6502160Z 2025-05-07T19:45:08.6502164Z 2025-05-07T19:45:08.6502167Z 2025-05-07T19:45:08.6502171Z 2025-05-07T19:45:08.6502174Z 2025-05-07T19:45:08.6502177Z 2025-05-07T19:45:08.6502181Z 2025-05-07T19:45:08.6502184Z 2025-05-07T19:45:08.6502370Z  2025-05-07T19:45:08.6502580Z 2025-05-07T19:45:08.6502584Z 2025-05-07T19:45:08.6502587Z 2025-05-07T19:45:08.6502591Z 2025-05-07T19:45:08.6502594Z 2025-05-07T19:45:08.6502598Z 2025-05-07T19:45:08.6502601Z 2025-05-07T19:45:08.6502604Z 2025-05-07T19:45:08.6502608Z 2025-05-07T19:45:08.6502611Z 2025-05-07T19:45:08.6502615Z 2025-05-07T19:45:08.6502618Z 2025-05-07T19:45:08.6502621Z 2025-05-07T19:45:08.6502649Z 2025-05-07T19:45:08.6502657Z 2025-05-07T19:45:08.6502825Z  2025-05-07T19:45:08.6503041Z 2025-05-07T19:45:08.6503044Z 2025-05-07T19:45:08.6503048Z 2025-05-07T19:45:08.6503051Z 2025-05-07T19:45:08.6503054Z 2025-05-07T19:45:08.6503058Z 2025-05-07T19:45:08.6503061Z 2025-05-07T19:45:08.6503065Z 2025-05-07T19:45:08.6503093Z 2025-05-07T19:45:08.6503097Z 2025-05-07T19:45:08.6503100Z 2025-05-07T19:45:08.6503104Z 2025-05-07T19:45:08.6503107Z 2025-05-07T19:45:08.6503110Z 2025-05-07T19:45:08.6503114Z 2025-05-07T19:45:08.6503117Z 2025-05-07T19:45:08.6503282Z  2025-05-07T19:45:08.6503505Z 2025-05-07T19:45:08.6503508Z 2025-05-07T19:45:08.6503511Z 2025-05-07T19:45:08.6503542Z 2025-05-07T19:45:08.6503545Z 2025-05-07T19:45:08.6503548Z 2025-05-07T19:45:08.6503552Z 2025-05-07T19:45:08.6503555Z 2025-05-07T19:45:08.6503559Z 2025-05-07T19:45:08.6503562Z 2025-05-07T19:45:08.6503565Z 2025-05-07T19:45:08.6503569Z 2025-05-07T19:45:08.6503575Z 2025-05-07T19:45:08.6503582Z 2025-05-07T19:45:08.6503585Z 2025-05-07T19:45:08.6503589Z 2025-05-07T19:45:08.6503592Z 2025-05-07T19:45:08.6503761Z  2025-05-07T19:45:08.6504018Z 2025-05-07T19:45:08.6504022Z 2025-05-07T19:45:08.6504025Z 2025-05-07T19:45:08.6504029Z 2025-05-07T19:45:08.6504032Z 2025-05-07T19:45:08.6504036Z 2025-05-07T19:45:08.6504039Z 2025-05-07T19:45:08.6504042Z 2025-05-07T19:45:08.6504046Z 2025-05-07T19:45:08.6504049Z 2025-05-07T19:45:08.6504052Z 2025-05-07T19:45:08.6504056Z 2025-05-07T19:45:08.6504059Z 2025-05-07T19:45:08.6504063Z 2025-05-07T19:45:08.6504066Z 2025-05-07T19:45:08.6504070Z 2025-05-07T19:45:08.6504073Z 2025-05-07T19:45:08.6504076Z 2025-05-07T19:45:08.6504281Z  2025-05-07T19:45:08.6504514Z 2025-05-07T19:45:08.6504518Z 2025-05-07T19:45:08.6504626Z  2025-05-07T19:45:08.6504767Z 2025-05-07T19:45:08.6504771Z 2025-05-07T19:45:08.6504881Z  2025-05-07T19:45:08.6505060Z 2025-05-07T19:45:08.6505065Z 2025-05-07T19:45:08.6505068Z 2025-05-07T19:45:08.6505213Z  2025-05-07T19:45:08.6505336Z 2025-05-07T19:45:08.6505340Z 2025-05-07T19:45:08.6505343Z 2025-05-07T19:45:08.6505347Z 2025-05-07T19:45:08.6505462Z  2025-05-07T19:45:08.6505622Z 2025-05-07T19:45:08.6505625Z 2025-05-07T19:45:08.6505629Z 2025-05-07T19:45:08.6505632Z 2025-05-07T19:45:08.6505635Z 2025-05-07T19:45:08.6505769Z  done 2025-05-07T19:45:08.9719251Z Preparing transaction: | / - done 2025-05-07T19:45:12.5535053Z Verifying transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:45:15.1708499Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:45:15.5907809Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:17.4587228Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:17.4588373Z 2025-05-07T19:45:17.4607197Z 2025-05-07T19:45:17.4636626Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:19.7518764Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:19.7520480Z 2025-05-07T19:45:19.7520596Z Collecting build 2025-05-07T19:45:19.7520988Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:19.7521873Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build) (25.0) 2025-05-07T19:45:19.7522929Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:19.7523361Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:19.7524175Z Requirement already satisfied: tomli>=1.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build) (2.2.1) 2025-05-07T19:45:19.7524899Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:19.7525375Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:19.7525814Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:19.7526107Z 2025-05-07T19:45:19.7526307Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:19.7526609Z 2025-05-07T19:45:21.5801827Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:21.5802141Z 2025-05-07T19:45:21.6550701Z [CHECK] Binary make found in PATH 2025-05-07T19:45:23.4245406Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:23.4246252Z 2025-05-07T19:45:23.4821500Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:25.2502889Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:25.2503727Z 2025-05-07T19:45:25.3068165Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:27.2039544Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:29.1880168Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:31.0560259Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:33.0269888Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:34.8466214Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:34.8467429Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:34.8534212Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:34.8534661Z . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:34.8535286Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:34.8535622Z env: 2025-05-07T19:45:34.8535842Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:34.8536186Z BUILD_ENV: build_binary 2025-05-07T19:45:34.8536428Z BUILD_TARGET: default 2025-05-07T19:45:34.8536668Z BUILD_VARIANT: cuda 2025-05-07T19:45:34.8536897Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:34.8537179Z ##[endgroup] 2025-05-07T19:45:35.2618849Z ################################################################################ 2025-05-07T19:45:35.2619273Z # Install CUDA 2025-05-07T19:45:35.2619554Z # 2025-05-07T19:45:35.2636170Z # [2025-05-07T19:45:35.262Z] + install_cuda build_binary 12.6.3 2025-05-07T19:45:35.2636668Z ################################################################################ 2025-05-07T19:45:35.2636981Z 2025-05-07T19:45:35.2651283Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:35.3471093Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:35.3472210Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:35.3473157Z + conda clean --packages --tarball -y 2025-05-07T19:45:35.3474305Z 2025-05-07T19:45:35.8701981Z Will remove 133 (485.9 MB) tarball(s). 2025-05-07T19:45:35.8702939Z Will remove 16 (77.4 MB) package(s). 2025-05-07T19:45:35.9266743Z 2025-05-07T19:45:35.9273671Z + conda clean --all -y 2025-05-07T19:45:35.9274213Z 2025-05-07T19:45:36.5277454Z There are no unused tarball(s) to remove. 2025-05-07T19:45:36.5277838Z Will remove 1 index cache(s). 2025-05-07T19:45:36.5278152Z There are no unused package(s) to remove. 2025-05-07T19:45:36.5278471Z There are no tempfile(s) to remove. 2025-05-07T19:45:36.5278782Z There are no logfile(s) to remove. 2025-05-07T19:45:36.5838912Z 2025-05-07T19:45:36.5848132Z [INSTALL] Installing CUDA 12.6.3 ... 2025-05-07T19:45:36.5875510Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c conda-forge --override-channels -y cuda=12.6.3 2025-05-07T19:45:37.4089971Z Channels: 2025-05-07T19:45:37.4090665Z - conda-forge 2025-05-07T19:45:37.4091298Z Platform: linux-64 2025-05-07T19:45:47.1006020Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:45:48.5359376Z Solving environment: | / - \ done 2025-05-07T19:45:48.6624279Z 2025-05-07T19:45:48.6624625Z ## Package Plan ## 2025-05-07T19:45:48.6624843Z 2025-05-07T19:45:48.6625115Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:48.6625746Z 2025-05-07T19:45:48.6625945Z added / updated specs: 2025-05-07T19:45:48.6626454Z - cuda=12.6.3 2025-05-07T19:45:48.6626606Z 2025-05-07T19:45:48.6626612Z 2025-05-07T19:45:48.6626784Z The following packages will be downloaded: 2025-05-07T19:45:48.6627032Z 2025-05-07T19:45:48.6627221Z package | build 2025-05-07T19:45:48.6627614Z ---------------------------|----------------- 2025-05-07T19:45:48.6628007Z attr-2.5.1 | h166bdaf_1 69 KB conda-forge 2025-05-07T19:45:48.6628661Z binutils-2.40 | h4852527_7 31 KB conda-forge 2025-05-07T19:45:48.6629408Z c-compiler-1.5.2 | h0b41bf4_0 6 KB conda-forge 2025-05-07T19:45:48.6629893Z cuda-12.6.3 | ha804496_0 26 KB conda-forge 2025-05-07T19:45:48.6630400Z cuda-cccl_linux-64-12.6.77 | ha770c72_0 1.0 MB conda-forge 2025-05-07T19:45:48.6631348Z cuda-command-line-tools-12.6.3| ha770c72_0 20 KB conda-forge 2025-05-07T19:45:48.6631988Z cuda-compiler-12.6.3 | hbad6d8a_0 20 KB conda-forge 2025-05-07T19:45:48.6632519Z cuda-crt-dev_linux-64-12.6.85| ha770c72_0 87 KB conda-forge 2025-05-07T19:45:48.6633480Z cuda-crt-tools-12.6.85 | ha770c72_0 26 KB conda-forge 2025-05-07T19:45:48.6634022Z cuda-cudart-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:48.6634525Z cuda-cudart-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:48.6635106Z cuda-cudart-dev_linux-64-12.6.77| h3f2d84a_0 357 KB conda-forge 2025-05-07T19:45:48.6635678Z cuda-cudart-static-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:48.6636373Z cuda-cudart-static_linux-64-12.6.77| h3f2d84a_0 744 KB conda-forge 2025-05-07T19:45:48.6636947Z cuda-cudart_linux-64-12.6.77| h3f2d84a_0 184 KB conda-forge 2025-05-07T19:45:48.6637505Z cuda-cuobjdump-12.6.77 | hbd13f7d_1 241 KB conda-forge 2025-05-07T19:45:48.6638043Z cuda-cupti-12.6.80 | hbd13f7d_0 1.9 MB conda-forge 2025-05-07T19:45:48.6638547Z cuda-cupti-dev-12.6.80 | h5888daf_0 3.4 MB conda-forge 2025-05-07T19:45:48.6639082Z cuda-cuxxfilt-12.6.77 | hbd13f7d_1 211 KB conda-forge 2025-05-07T19:45:48.6639592Z cuda-driver-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:48.6640165Z cuda-driver-dev_linux-64-12.6.77| h3f2d84a_0 35 KB conda-forge 2025-05-07T19:45:48.6642016Z cuda-gdb-12.6.77 | h50b4baa_1 370 KB conda-forge 2025-05-07T19:45:48.6642537Z cuda-libraries-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:45:48.6643098Z cuda-libraries-dev-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:45:48.6643611Z cuda-nsight-12.6.77 | h7938cbb_0 113.2 MB conda-forge 2025-05-07T19:45:48.6644116Z cuda-nvcc-12.6.85 | hcdd1206_0 23 KB conda-forge 2025-05-07T19:45:48.6644622Z cuda-nvcc-dev_linux-64-12.6.85| he91c749_0 10.8 MB conda-forge 2025-05-07T19:45:48.6645181Z cuda-nvcc-impl-12.6.85 | h85509e4_0 25 KB conda-forge 2025-05-07T19:45:48.6645712Z cuda-nvcc-tools-12.6.85 | he02047a_0 23.0 MB conda-forge 2025-05-07T19:45:48.6646219Z cuda-nvcc_linux-64-12.6.85 | h04802cd_0 25 KB conda-forge 2025-05-07T19:45:48.6646947Z cuda-nvdisasm-12.6.77 | hbd13f7d_1 47.6 MB conda-forge 2025-05-07T19:45:48.6647453Z cuda-nvml-dev-12.6.77 | hbd13f7d_1 159 KB conda-forge 2025-05-07T19:45:48.6647972Z cuda-nvprof-12.6.80 | hbd13f7d_0 2.6 MB conda-forge 2025-05-07T19:45:48.6648463Z cuda-nvprune-12.6.77 | hbd13f7d_1 66 KB conda-forge 2025-05-07T19:45:48.6648970Z cuda-nvrtc-12.6.85 | hbd13f7d_0 17.3 MB conda-forge 2025-05-07T19:45:48.6649490Z cuda-nvrtc-dev-12.6.85 | h5888daf_0 31 KB conda-forge 2025-05-07T19:45:48.6649986Z cuda-nvtx-12.6.77 | hbd13f7d_0 31 KB conda-forge 2025-05-07T19:45:48.6650518Z cuda-nvvm-dev_linux-64-12.6.85| ha770c72_0 25 KB conda-forge 2025-05-07T19:45:48.6651048Z cuda-nvvm-impl-12.6.85 | he02047a_0 7.7 MB conda-forge 2025-05-07T19:45:48.6651585Z cuda-nvvm-tools-12.6.85 | he02047a_0 10.4 MB conda-forge 2025-05-07T19:45:48.6652078Z cuda-nvvp-12.6.80 | hbd13f7d_1 109.3 MB conda-forge 2025-05-07T19:45:48.6652585Z cuda-opencl-12.6.77 | hbd13f7d_0 29 KB conda-forge 2025-05-07T19:45:48.6653105Z cuda-opencl-dev-12.6.77 | h5888daf_0 93 KB conda-forge 2025-05-07T19:45:48.6653603Z cuda-profiler-api-12.6.77 | h7938cbb_0 22 KB conda-forge 2025-05-07T19:45:48.6654101Z cuda-runtime-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:45:48.6654588Z cuda-sanitizer-api-12.6.77 | hbd13f7d_1 8.9 MB conda-forge 2025-05-07T19:45:48.6655303Z cuda-toolkit-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:45:48.6655779Z cuda-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:45:48.6656229Z cuda-version-12.6 | h7480c83_3 20 KB conda-forge 2025-05-07T19:45:48.6656720Z cuda-visual-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:45:48.6657205Z cxx-compiler-1.5.2 | hf52228f_0 6 KB conda-forge 2025-05-07T19:45:48.6657648Z dbus-1.13.6 | h5008d03_3 604 KB conda-forge 2025-05-07T19:45:48.6658049Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:45:48.6658569Z gcc-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:45:48.6658990Z gds-tools-1.11.1.6 | h5888daf_4 37.8 MB conda-forge 2025-05-07T19:45:48.6659395Z gmp-6.3.0 | hac33072_2 449 KB conda-forge 2025-05-07T19:45:48.6659792Z gxx-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:45:48.6660183Z libcap-2.75 | h39aace5_0 118 KB conda-forge 2025-05-07T19:45:48.6660620Z libcublas-12.6.4.1 | h5888daf_1 256.2 MB conda-forge 2025-05-07T19:45:48.6661179Z libcublas-dev-12.6.4.1 | h5888daf_1 88 KB conda-forge 2025-05-07T19:45:48.6661644Z libcufft-11.3.0.4 | hbd13f7d_0 156.2 MB conda-forge 2025-05-07T19:45:48.6662094Z libcufft-dev-11.3.0.4 | h5888daf_0 33 KB conda-forge 2025-05-07T19:45:48.6662597Z libcufile-1.11.1.6 | h12f29b5_4 900 KB conda-forge 2025-05-07T19:45:48.6663070Z libcufile-dev-1.11.1.6 | h5888daf_4 35 KB conda-forge 2025-05-07T19:45:48.6663573Z libcurand-10.3.7.77 | hbd13f7d_0 39.9 MB conda-forge 2025-05-07T19:45:48.6664058Z libcurand-dev-10.3.7.77 | h5888daf_0 262 KB conda-forge 2025-05-07T19:45:48.6664580Z libcusolver-11.7.1.2 | h5888daf_1 95.8 MB conda-forge 2025-05-07T19:45:48.6665106Z libcusolver-dev-11.7.1.2 | h5888daf_1 59 KB conda-forge 2025-05-07T19:45:48.6665607Z libcusparse-12.5.4.2 | hbd13f7d_0 118.6 MB conda-forge 2025-05-07T19:45:48.6666143Z libcusparse-dev-12.5.4.2 | h5888daf_0 51 KB conda-forge 2025-05-07T19:45:48.6666640Z libgcrypt-lib-1.11.0 | hb9d3cd8_2 572 KB conda-forge 2025-05-07T19:45:48.6667146Z libgpg-error-1.55 | h3f2d84a_0 305 KB conda-forge 2025-05-07T19:45:48.6667591Z libnl-3.11.0 | hb9d3cd8_0 724 KB conda-forge 2025-05-07T19:45:48.6668059Z libnpp-12.3.1.54 | h5888daf_0 93.4 MB conda-forge 2025-05-07T19:45:48.6668538Z libnpp-dev-12.3.1.54 | h5888daf_0 441 KB conda-forge 2025-05-07T19:45:48.6668991Z libnuma-2.0.18 | h4ab18f5_2 42 KB conda-forge 2025-05-07T19:45:48.6669476Z libnvfatbin-12.6.77 | hbd13f7d_0 783 KB conda-forge 2025-05-07T19:45:48.6669964Z libnvfatbin-dev-12.6.77 | h5888daf_0 26 KB conda-forge 2025-05-07T19:45:48.6670673Z libnvjitlink-12.6.85 | hbd13f7d_0 14.9 MB conda-forge 2025-05-07T19:45:48.6671205Z libnvjitlink-dev-12.6.85 | h5888daf_0 25 KB conda-forge 2025-05-07T19:45:48.6671699Z libnvjpeg-12.3.3.54 | h5888daf_0 2.4 MB conda-forge 2025-05-07T19:45:48.6672213Z libnvjpeg-dev-12.3.3.54 | ha770c72_0 31 KB conda-forge 2025-05-07T19:45:48.6672706Z libsystemd0-257.4 | h4e0b6ca_1 477 KB conda-forge 2025-05-07T19:45:48.6673211Z libudev1-257.4 | hbe16f8c_1 141 KB conda-forge 2025-05-07T19:45:48.6673763Z libxkbcommon-1.9.2 | h65c71a3_0 660 KB conda-forge 2025-05-07T19:45:48.6674266Z libxkbfile-1.1.0 | h166bdaf_1 111 KB conda-forge 2025-05-07T19:45:48.6674754Z libxml2-2.13.8 | h4bc477f_0 675 KB conda-forge 2025-05-07T19:45:48.6675183Z lz4-c-1.10.0 | h5888daf_1 163 KB conda-forge 2025-05-07T19:45:48.6675695Z nsight-compute-2024.3.2.3 | hb5ebaad_0 443.1 MB conda-forge 2025-05-07T19:45:48.6676250Z nspr-4.36 | h5888daf_0 225 KB conda-forge 2025-05-07T19:45:48.6676704Z nss-3.111 | h159eef7_0 1.9 MB conda-forge 2025-05-07T19:45:48.6677137Z ocl-icd-2.3.3 | hb9d3cd8_0 104 KB conda-forge 2025-05-07T19:45:48.6677656Z opencl-headers-2024.10.24 | h5888daf_0 53 KB conda-forge 2025-05-07T19:45:48.6678178Z rdma-core-57.0 | h5888daf_0 1.2 MB conda-forge 2025-05-07T19:45:48.6678631Z wayland-1.23.1 | h3e06ad9_0 314 KB conda-forge 2025-05-07T19:45:48.6679098Z xcb-util-0.4.1 | hb711507_2 19 KB conda-forge 2025-05-07T19:45:48.6679572Z xcb-util-cursor-0.1.5 | hb9d3cd8_0 20 KB conda-forge 2025-05-07T19:45:48.6680094Z xcb-util-image-0.4.0 | hb711507_2 24 KB conda-forge 2025-05-07T19:45:48.6680700Z xcb-util-keysyms-0.4.1 | hb711507_0 14 KB conda-forge 2025-05-07T19:45:48.6681224Z xcb-util-renderutil-0.3.10 | hb711507_0 17 KB conda-forge 2025-05-07T19:45:48.6681747Z xcb-util-wm-0.4.2 | hb711507_0 50 KB conda-forge 2025-05-07T19:45:48.6682233Z xkeyboard-config-2.44 | hb9d3cd8_0 384 KB conda-forge 2025-05-07T19:45:48.6682787Z xorg-libxcomposite-0.4.6 | hb9d3cd8_2 13 KB conda-forge 2025-05-07T19:45:48.6683312Z xorg-libxdamage-1.1.6 | hb9d3cd8_0 13 KB conda-forge 2025-05-07T19:45:48.6683827Z ------------------------------------------------------------ 2025-05-07T19:45:48.6684235Z Total: 1.59 GB 2025-05-07T19:45:48.6684470Z 2025-05-07T19:45:48.6684616Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:48.6684898Z 2025-05-07T19:45:48.6685099Z attr conda-forge/linux-64::attr-2.5.1-h166bdaf_1 2025-05-07T19:45:48.6685587Z binutils conda-forge/linux-64::binutils-2.40-h4852527_7 2025-05-07T19:45:48.6686086Z c-compiler conda-forge/linux-64::c-compiler-1.5.2-h0b41bf4_0 2025-05-07T19:45:48.6686583Z cuda conda-forge/noarch::cuda-12.6.3-ha804496_0 2025-05-07T19:45:48.6687097Z cuda-cccl_linux-64 conda-forge/noarch::cuda-cccl_linux-64-12.6.77-ha770c72_0 2025-05-07T19:45:48.6687770Z cuda-command-line~ conda-forge/linux-64::cuda-command-line-tools-12.6.3-ha770c72_0 2025-05-07T19:45:48.6688525Z cuda-compiler conda-forge/noarch::cuda-compiler-12.6.3-hbad6d8a_0 2025-05-07T19:45:48.6689075Z cuda-crt-dev_linu~ conda-forge/noarch::cuda-crt-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:45:48.6689659Z cuda-crt-tools conda-forge/linux-64::cuda-crt-tools-12.6.85-ha770c72_0 2025-05-07T19:45:48.6690185Z cuda-cudart conda-forge/linux-64::cuda-cudart-12.6.77-h5888daf_0 2025-05-07T19:45:48.6690752Z cuda-cudart-dev conda-forge/linux-64::cuda-cudart-dev-12.6.77-h5888daf_0 2025-05-07T19:45:48.6691363Z cuda-cudart-dev_l~ conda-forge/noarch::cuda-cudart-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:48.6691974Z cuda-cudart-static conda-forge/linux-64::cuda-cudart-static-12.6.77-h5888daf_0 2025-05-07T19:45:48.6692624Z cuda-cudart-stati~ conda-forge/noarch::cuda-cudart-static_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:48.6693260Z cuda-cudart_linux~ conda-forge/noarch::cuda-cudart_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:48.6693826Z cuda-cuobjdump conda-forge/linux-64::cuda-cuobjdump-12.6.77-hbd13f7d_1 2025-05-07T19:45:48.6694440Z cuda-cupti conda-forge/linux-64::cuda-cupti-12.6.80-hbd13f7d_0 2025-05-07T19:45:48.6694948Z cuda-cupti-dev conda-forge/linux-64::cuda-cupti-dev-12.6.80-h5888daf_0 2025-05-07T19:45:48.6695508Z cuda-cuxxfilt conda-forge/linux-64::cuda-cuxxfilt-12.6.77-hbd13f7d_1 2025-05-07T19:45:48.6696083Z cuda-driver-dev conda-forge/linux-64::cuda-driver-dev-12.6.77-h5888daf_0 2025-05-07T19:45:48.6696672Z cuda-driver-dev_l~ conda-forge/noarch::cuda-driver-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:48.6697246Z cuda-gdb conda-forge/linux-64::cuda-gdb-12.6.77-h50b4baa_1 2025-05-07T19:45:48.6697737Z cuda-libraries conda-forge/linux-64::cuda-libraries-12.6.3-ha770c72_0 2025-05-07T19:45:48.6698339Z cuda-libraries-dev conda-forge/linux-64::cuda-libraries-dev-12.6.3-ha770c72_0 2025-05-07T19:45:48.6698926Z cuda-nsight conda-forge/linux-64::cuda-nsight-12.6.77-h7938cbb_0 2025-05-07T19:45:48.6699415Z cuda-nvcc conda-forge/linux-64::cuda-nvcc-12.6.85-hcdd1206_0 2025-05-07T19:45:48.6699975Z cuda-nvcc-dev_lin~ conda-forge/noarch::cuda-nvcc-dev_linux-64-12.6.85-he91c749_0 2025-05-07T19:45:48.6700542Z cuda-nvcc-impl conda-forge/linux-64::cuda-nvcc-impl-12.6.85-h85509e4_0 2025-05-07T19:45:48.6701119Z cuda-nvcc-tools conda-forge/linux-64::cuda-nvcc-tools-12.6.85-he02047a_0 2025-05-07T19:45:48.6701768Z cuda-nvcc_linux-64 conda-forge/linux-64::cuda-nvcc_linux-64-12.6.85-h04802cd_0 2025-05-07T19:45:48.6702320Z cuda-nvdisasm conda-forge/linux-64::cuda-nvdisasm-12.6.77-hbd13f7d_1 2025-05-07T19:45:48.6702881Z cuda-nvml-dev conda-forge/linux-64::cuda-nvml-dev-12.6.77-hbd13f7d_1 2025-05-07T19:45:48.6703392Z cuda-nvprof conda-forge/linux-64::cuda-nvprof-12.6.80-hbd13f7d_0 2025-05-07T19:45:48.6703942Z cuda-nvprune conda-forge/linux-64::cuda-nvprune-12.6.77-hbd13f7d_1 2025-05-07T19:45:48.6704476Z cuda-nvrtc conda-forge/linux-64::cuda-nvrtc-12.6.85-hbd13f7d_0 2025-05-07T19:45:48.6704992Z cuda-nvrtc-dev conda-forge/linux-64::cuda-nvrtc-dev-12.6.85-h5888daf_0 2025-05-07T19:45:48.6705527Z cuda-nvtx conda-forge/linux-64::cuda-nvtx-12.6.77-hbd13f7d_0 2025-05-07T19:45:48.6706051Z cuda-nvvm-dev_lin~ conda-forge/noarch::cuda-nvvm-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:45:48.6706636Z cuda-nvvm-impl conda-forge/linux-64::cuda-nvvm-impl-12.6.85-he02047a_0 2025-05-07T19:45:48.6707213Z cuda-nvvm-tools conda-forge/linux-64::cuda-nvvm-tools-12.6.85-he02047a_0 2025-05-07T19:45:48.6707721Z cuda-nvvp conda-forge/linux-64::cuda-nvvp-12.6.80-hbd13f7d_1 2025-05-07T19:45:48.6708229Z cuda-opencl conda-forge/linux-64::cuda-opencl-12.6.77-hbd13f7d_0 2025-05-07T19:45:48.6708748Z cuda-opencl-dev conda-forge/linux-64::cuda-opencl-dev-12.6.77-h5888daf_0 2025-05-07T19:45:48.6709349Z cuda-profiler-api conda-forge/linux-64::cuda-profiler-api-12.6.77-h7938cbb_0 2025-05-07T19:45:48.6709922Z cuda-runtime conda-forge/noarch::cuda-runtime-12.6.3-ha804496_0 2025-05-07T19:45:48.6710471Z cuda-sanitizer-api conda-forge/linux-64::cuda-sanitizer-api-12.6.77-hbd13f7d_1 2025-05-07T19:45:48.6711054Z cuda-toolkit conda-forge/noarch::cuda-toolkit-12.6.3-ha804496_0 2025-05-07T19:45:48.6711538Z cuda-tools conda-forge/linux-64::cuda-tools-12.6.3-ha770c72_0 2025-05-07T19:45:48.6712049Z cuda-version conda-forge/noarch::cuda-version-12.6-h7480c83_3 2025-05-07T19:45:48.6712746Z cuda-visual-tools conda-forge/linux-64::cuda-visual-tools-12.6.3-ha770c72_0 2025-05-07T19:45:48.6713352Z cxx-compiler conda-forge/linux-64::cxx-compiler-1.5.2-hf52228f_0 2025-05-07T19:45:48.6713858Z dbus conda-forge/linux-64::dbus-1.13.6-h5008d03_3 2025-05-07T19:45:48.6714287Z expat conda-forge/linux-64::expat-2.7.0-h5888daf_0 2025-05-07T19:45:48.6714732Z gcc conda-forge/linux-64::gcc-11.4.0-h602e360_13 2025-05-07T19:45:48.6715250Z gds-tools conda-forge/linux-64::gds-tools-1.11.1.6-h5888daf_4 2025-05-07T19:45:48.6715735Z gmp conda-forge/linux-64::gmp-6.3.0-hac33072_2 2025-05-07T19:45:48.6716412Z gxx conda-forge/linux-64::gxx-11.4.0-h602e360_13 2025-05-07T19:45:48.6716920Z libcap conda-forge/linux-64::libcap-2.75-h39aace5_0 2025-05-07T19:45:48.6717504Z libcublas conda-forge/linux-64::libcublas-12.6.4.1-h5888daf_1 2025-05-07T19:45:48.6718464Z libcublas-dev conda-forge/linux-64::libcublas-dev-12.6.4.1-h5888daf_1 2025-05-07T19:45:48.6719050Z libcufft conda-forge/linux-64::libcufft-11.3.0.4-hbd13f7d_0 2025-05-07T19:45:48.6719621Z libcufft-dev conda-forge/linux-64::libcufft-dev-11.3.0.4-h5888daf_0 2025-05-07T19:45:48.6720165Z libcufile conda-forge/linux-64::libcufile-1.11.1.6-h12f29b5_4 2025-05-07T19:45:48.6720746Z libcufile-dev conda-forge/linux-64::libcufile-dev-1.11.1.6-h5888daf_4 2025-05-07T19:45:48.6721309Z libcurand conda-forge/linux-64::libcurand-10.3.7.77-hbd13f7d_0 2025-05-07T19:45:48.6721972Z libcurand-dev conda-forge/linux-64::libcurand-dev-10.3.7.77-h5888daf_0 2025-05-07T19:45:48.6723015Z libcusolver conda-forge/linux-64::libcusolver-11.7.1.2-h5888daf_1 2025-05-07T19:45:48.6723595Z libcusolver-dev conda-forge/linux-64::libcusolver-dev-11.7.1.2-h5888daf_1 2025-05-07T19:45:48.6724317Z libcusparse conda-forge/linux-64::libcusparse-12.5.4.2-hbd13f7d_0 2025-05-07T19:45:48.6724903Z libcusparse-dev conda-forge/linux-64::libcusparse-dev-12.5.4.2-h5888daf_0 2025-05-07T19:45:48.6725523Z libgcrypt-lib conda-forge/linux-64::libgcrypt-lib-1.11.0-hb9d3cd8_2 2025-05-07T19:45:48.6726111Z libgpg-error conda-forge/linux-64::libgpg-error-1.55-h3f2d84a_0 2025-05-07T19:45:48.6726608Z libnl conda-forge/linux-64::libnl-3.11.0-hb9d3cd8_0 2025-05-07T19:45:48.6727127Z libnpp conda-forge/linux-64::libnpp-12.3.1.54-h5888daf_0 2025-05-07T19:45:48.6728035Z libnpp-dev conda-forge/linux-64::libnpp-dev-12.3.1.54-h5888daf_0 2025-05-07T19:45:48.6728667Z libnuma conda-forge/linux-64::libnuma-2.0.18-h4ab18f5_2 2025-05-07T19:45:48.6729207Z libnvfatbin conda-forge/linux-64::libnvfatbin-12.6.77-hbd13f7d_0 2025-05-07T19:45:48.6729779Z libnvfatbin-dev conda-forge/linux-64::libnvfatbin-dev-12.6.77-h5888daf_0 2025-05-07T19:45:48.6730391Z libnvjitlink conda-forge/linux-64::libnvjitlink-12.6.85-hbd13f7d_0 2025-05-07T19:45:48.6730980Z libnvjitlink-dev conda-forge/linux-64::libnvjitlink-dev-12.6.85-h5888daf_0 2025-05-07T19:45:48.6731576Z libnvjpeg conda-forge/linux-64::libnvjpeg-12.3.3.54-h5888daf_0 2025-05-07T19:45:48.6732159Z libnvjpeg-dev conda-forge/linux-64::libnvjpeg-dev-12.3.3.54-ha770c72_0 2025-05-07T19:45:48.6732713Z libsystemd0 conda-forge/linux-64::libsystemd0-257.4-h4e0b6ca_1 2025-05-07T19:45:48.6733241Z libudev1 conda-forge/linux-64::libudev1-257.4-hbe16f8c_1 2025-05-07T19:45:48.6733759Z libxkbcommon conda-forge/linux-64::libxkbcommon-1.9.2-h65c71a3_0 2025-05-07T19:45:48.6734318Z libxkbfile conda-forge/linux-64::libxkbfile-1.1.0-h166bdaf_1 2025-05-07T19:45:48.6734838Z libxml2 conda-forge/linux-64::libxml2-2.13.8-h4bc477f_0 2025-05-07T19:45:48.6735287Z lz4-c conda-forge/linux-64::lz4-c-1.10.0-h5888daf_1 2025-05-07T19:45:48.6735842Z nsight-compute conda-forge/linux-64::nsight-compute-2024.3.2.3-hb5ebaad_0 2025-05-07T19:45:48.6736481Z nspr conda-forge/linux-64::nspr-4.36-h5888daf_0 2025-05-07T19:45:48.6737259Z nss conda-forge/linux-64::nss-3.111-h159eef7_0 2025-05-07T19:45:48.6737724Z ocl-icd conda-forge/linux-64::ocl-icd-2.3.3-hb9d3cd8_0 2025-05-07T19:45:48.6738264Z opencl-headers conda-forge/linux-64::opencl-headers-2024.10.24-h5888daf_0 2025-05-07T19:45:48.6738840Z rdma-core conda-forge/linux-64::rdma-core-57.0-h5888daf_0 2025-05-07T19:45:48.6739400Z wayland conda-forge/linux-64::wayland-1.23.1-h3e06ad9_0 2025-05-07T19:45:48.6739907Z xcb-util conda-forge/linux-64::xcb-util-0.4.1-hb711507_2 2025-05-07T19:45:48.6740464Z xcb-util-cursor conda-forge/linux-64::xcb-util-cursor-0.1.5-hb9d3cd8_0 2025-05-07T19:45:48.6741038Z xcb-util-image conda-forge/linux-64::xcb-util-image-0.4.0-hb711507_2 2025-05-07T19:45:48.6741655Z xcb-util-keysyms conda-forge/linux-64::xcb-util-keysyms-0.4.1-hb711507_0 2025-05-07T19:45:48.6742286Z xcb-util-renderut~ conda-forge/linux-64::xcb-util-renderutil-0.3.10-hb711507_0 2025-05-07T19:45:48.6742903Z xcb-util-wm conda-forge/linux-64::xcb-util-wm-0.4.2-hb711507_0 2025-05-07T19:45:48.6743493Z xkeyboard-config conda-forge/linux-64::xkeyboard-config-2.44-hb9d3cd8_0 2025-05-07T19:45:48.6744117Z xorg-libxcomposite conda-forge/linux-64::xorg-libxcomposite-0.4.6-hb9d3cd8_2 2025-05-07T19:45:48.6744786Z xorg-libxdamage conda-forge/linux-64::xorg-libxdamage-1.1.6-hb9d3cd8_0 2025-05-07T19:45:48.6745439Z 2025-05-07T19:45:48.6745479Z 2025-05-07T19:45:48.6745487Z 2025-05-07T19:45:48.6745698Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:48.6746153Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:45:48.6746603Z 2025-05-07T19:45:48.6747063Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:45:48.6747521Z 2025-05-07T19:45:48.6747525Z 2025-05-07T19:45:48.6747760Z libcufft-11.3.0.4 | 156.2 MB | | 0%  2025-05-07T19:45:48.6748062Z 2025-05-07T19:45:48.6748066Z 2025-05-07T19:45:48.6748070Z 2025-05-07T19:45:48.6748326Z libcusparse-12.5.4.2 | 118.6 MB | | 0%  2025-05-07T19:45:48.6748615Z 2025-05-07T19:45:48.6748618Z 2025-05-07T19:45:48.6748622Z 2025-05-07T19:45:48.6750885Z 2025-05-07T19:45:48.6764920Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:45:48.6765466Z 2025-05-07T19:45:48.6765469Z 2025-05-07T19:45:48.6765473Z 2025-05-07T19:45:48.6765477Z 2025-05-07T19:45:48.6765485Z 2025-05-07T19:45:48.6766119Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:45:48.6766709Z 2025-05-07T19:45:48.6766716Z 2025-05-07T19:45:48.6766722Z 2025-05-07T19:45:48.6766727Z 2025-05-07T19:45:48.6766734Z 2025-05-07T19:45:48.6766749Z 2025-05-07T19:45:48.6767228Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:45:48.6767844Z 2025-05-07T19:45:48.6767849Z 2025-05-07T19:45:48.6767854Z 2025-05-07T19:45:48.6767861Z 2025-05-07T19:45:48.6767867Z 2025-05-07T19:45:48.6767873Z 2025-05-07T19:45:48.6767890Z 2025-05-07T19:45:48.6768325Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:45:48.6768898Z 2025-05-07T19:45:48.6768905Z 2025-05-07T19:45:48.6768911Z 2025-05-07T19:45:48.6768917Z 2025-05-07T19:45:48.6768923Z 2025-05-07T19:45:48.6768929Z 2025-05-07T19:45:48.6768935Z 2025-05-07T19:45:48.6768942Z 2025-05-07T19:45:48.6769519Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:45:48.6770079Z 2025-05-07T19:45:48.6770083Z 2025-05-07T19:45:48.6770086Z 2025-05-07T19:45:48.6770090Z 2025-05-07T19:45:48.6770093Z 2025-05-07T19:45:48.6770097Z 2025-05-07T19:45:48.6770100Z 2025-05-07T19:45:48.6770103Z 2025-05-07T19:45:48.6770110Z 2025-05-07T19:45:48.6770748Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:45:48.6771319Z 2025-05-07T19:45:48.6771322Z 2025-05-07T19:45:48.6771326Z 2025-05-07T19:45:48.6771336Z 2025-05-07T19:45:48.6771339Z 2025-05-07T19:45:48.6771342Z 2025-05-07T19:45:48.6771346Z 2025-05-07T19:45:48.6771349Z 2025-05-07T19:45:48.6771352Z 2025-05-07T19:45:48.6771356Z 2025-05-07T19:45:48.6771733Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:45:48.6772098Z 2025-05-07T19:45:48.6772110Z 2025-05-07T19:45:48.6772115Z 2025-05-07T19:45:48.6772167Z 2025-05-07T19:45:48.6772172Z 2025-05-07T19:45:48.6772179Z 2025-05-07T19:45:48.6772187Z 2025-05-07T19:45:48.6772410Z 2025-05-07T19:45:48.6772418Z 2025-05-07T19:45:48.6772425Z 2025-05-07T19:45:48.6772431Z 2025-05-07T19:45:48.6773236Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:45:48.6773581Z 2025-05-07T19:45:48.6773586Z 2025-05-07T19:45:48.6773590Z 2025-05-07T19:45:48.6773615Z 2025-05-07T19:45:48.6773618Z 2025-05-07T19:45:48.6773622Z 2025-05-07T19:45:48.6773626Z 2025-05-07T19:45:48.6773630Z 2025-05-07T19:45:48.6773633Z 2025-05-07T19:45:48.6773637Z 2025-05-07T19:45:48.6773641Z 2025-05-07T19:45:48.6773658Z 2025-05-07T19:45:48.6773955Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:45:48.6774257Z 2025-05-07T19:45:48.6774261Z 2025-05-07T19:45:48.6774265Z 2025-05-07T19:45:48.6774268Z 2025-05-07T19:45:48.6774272Z 2025-05-07T19:45:48.6774276Z 2025-05-07T19:45:48.6774279Z 2025-05-07T19:45:48.6774283Z 2025-05-07T19:45:48.6774287Z 2025-05-07T19:45:48.6774290Z 2025-05-07T19:45:48.6774302Z 2025-05-07T19:45:48.6774320Z 2025-05-07T19:45:48.6774327Z 2025-05-07T19:45:48.6774923Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:45:48.6775237Z 2025-05-07T19:45:48.6775241Z 2025-05-07T19:45:48.6775244Z 2025-05-07T19:45:48.6775248Z 2025-05-07T19:45:48.6776322Z 2025-05-07T19:45:48.6776326Z 2025-05-07T19:45:48.6776329Z 2025-05-07T19:45:48.6776333Z 2025-05-07T19:45:48.6776336Z 2025-05-07T19:45:48.6776339Z 2025-05-07T19:45:48.6776343Z 2025-05-07T19:45:48.6776346Z 2025-05-07T19:45:48.6776349Z 2025-05-07T19:45:48.6776353Z 2025-05-07T19:45:48.6776708Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:45:48.6777048Z 2025-05-07T19:45:48.6777052Z 2025-05-07T19:45:48.6777055Z 2025-05-07T19:45:48.6777058Z 2025-05-07T19:45:48.6777062Z 2025-05-07T19:45:48.6777067Z 2025-05-07T19:45:48.6777070Z 2025-05-07T19:45:48.6777073Z 2025-05-07T19:45:48.6777077Z 2025-05-07T19:45:48.6777092Z 2025-05-07T19:45:48.6777095Z 2025-05-07T19:45:48.6777099Z 2025-05-07T19:45:48.6777102Z 2025-05-07T19:45:48.6777106Z 2025-05-07T19:45:48.6777109Z 2025-05-07T19:45:48.6777437Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:45:48.6777761Z 2025-05-07T19:45:48.6777771Z 2025-05-07T19:45:48.6777774Z 2025-05-07T19:45:48.6777778Z 2025-05-07T19:45:48.6777781Z 2025-05-07T19:45:48.6777784Z 2025-05-07T19:45:48.6777788Z 2025-05-07T19:45:48.6777791Z 2025-05-07T19:45:48.6777794Z 2025-05-07T19:45:48.6777798Z 2025-05-07T19:45:48.6777801Z 2025-05-07T19:45:48.6777804Z 2025-05-07T19:45:48.6777808Z 2025-05-07T19:45:48.6777811Z 2025-05-07T19:45:48.6777814Z 2025-05-07T19:45:48.6777833Z 2025-05-07T19:45:48.6782391Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:45:48.6782738Z 2025-05-07T19:45:48.6782742Z 2025-05-07T19:45:48.6782746Z 2025-05-07T19:45:48.6782749Z 2025-05-07T19:45:48.6782757Z 2025-05-07T19:45:48.6782761Z 2025-05-07T19:45:48.6782778Z 2025-05-07T19:45:48.6782781Z 2025-05-07T19:45:48.6782785Z 2025-05-07T19:45:48.6782788Z 2025-05-07T19:45:48.6782791Z 2025-05-07T19:45:48.6782795Z 2025-05-07T19:45:48.6782798Z 2025-05-07T19:45:48.6782802Z 2025-05-07T19:45:48.6782805Z 2025-05-07T19:45:48.6782812Z 2025-05-07T19:45:48.6782820Z 2025-05-07T19:45:48.6783525Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:45:48.6783864Z 2025-05-07T19:45:48.6783867Z 2025-05-07T19:45:48.6783871Z 2025-05-07T19:45:48.6783874Z 2025-05-07T19:45:48.6783878Z 2025-05-07T19:45:48.6783888Z 2025-05-07T19:45:48.6783891Z 2025-05-07T19:45:48.6783895Z 2025-05-07T19:45:48.6783898Z 2025-05-07T19:45:48.6783902Z 2025-05-07T19:45:48.6783905Z 2025-05-07T19:45:48.6783908Z 2025-05-07T19:45:48.6783912Z 2025-05-07T19:45:48.6783915Z 2025-05-07T19:45:48.6783919Z 2025-05-07T19:45:48.6783922Z 2025-05-07T19:45:48.6784017Z 2025-05-07T19:45:48.6784022Z 2025-05-07T19:45:48.6784849Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:45:48.6785186Z 2025-05-07T19:45:48.6785189Z 2025-05-07T19:45:48.6785193Z 2025-05-07T19:45:48.6785196Z 2025-05-07T19:45:48.6785200Z 2025-05-07T19:45:48.6785208Z 2025-05-07T19:45:48.6785211Z 2025-05-07T19:45:48.6785229Z 2025-05-07T19:45:48.6785233Z 2025-05-07T19:45:48.6785236Z 2025-05-07T19:45:48.6785240Z 2025-05-07T19:45:48.6785243Z 2025-05-07T19:45:48.6785247Z 2025-05-07T19:45:48.6785250Z 2025-05-07T19:45:48.6785253Z 2025-05-07T19:45:48.6785257Z 2025-05-07T19:45:48.6785260Z 2025-05-07T19:45:48.6785264Z 2025-05-07T19:45:48.6785267Z 2025-05-07T19:45:48.7721206Z ... (more hidden) ... 2025-05-07T19:45:48.7729897Z nsight-compute-2024. | 443.1 MB | | 1% 2025-05-07T19:45:48.7730215Z 2025-05-07T19:45:48.7730959Z 2025-05-07T19:45:48.7737849Z libcufft-11.3.0.4 | 156.2 MB | | 1%  2025-05-07T19:45:48.7738162Z 2025-05-07T19:45:48.7738168Z 2025-05-07T19:45:48.7738173Z 2025-05-07T19:45:48.7756130Z libcusparse-12.5.4.2 | 118.6 MB | 1 | 1%  2025-05-07T19:45:48.7756461Z 2025-05-07T19:45:48.7756467Z 2025-05-07T19:45:48.7756470Z 2025-05-07T19:45:48.7756752Z 2025-05-07T19:45:48.7921278Z cuda-nsight-12.6.77 | 113.2 MB | 1 | 1%  2025-05-07T19:45:48.7921837Z 2025-05-07T19:45:48.8721570Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:45:48.8744727Z nsight-compute-2024. | 443.1 MB | 2 | 2% 2025-05-07T19:45:48.8745247Z 2025-05-07T19:45:48.8745256Z 2025-05-07T19:45:48.8745267Z 2025-05-07T19:45:48.8746966Z libcusparse-12.5.4.2 | 118.6 MB | 6 | 6%  2025-05-07T19:45:48.8747439Z 2025-05-07T19:45:48.8748787Z 2025-05-07T19:45:48.8771971Z libcufft-11.3.0.4 | 156.2 MB | 4 | 5%  2025-05-07T19:45:48.8772446Z 2025-05-07T19:45:48.8772508Z 2025-05-07T19:45:48.8772513Z 2025-05-07T19:45:48.8772517Z 2025-05-07T19:45:48.8921584Z cuda-nsight-12.6.77 | 113.2 MB | 5 | 5%  2025-05-07T19:45:48.8921994Z 2025-05-07T19:45:48.9725282Z libcublas-12.6.4.1 | 256.2 MB | 3 | 3%  2025-05-07T19:45:48.9749813Z nsight-compute-2024. | 443.1 MB | 4 | 4% 2025-05-07T19:45:48.9750400Z 2025-05-07T19:45:48.9750405Z 2025-05-07T19:45:48.9772602Z libcufft-11.3.0.4 | 156.2 MB | 8 | 9%  2025-05-07T19:45:48.9772927Z 2025-05-07T19:45:48.9773084Z 2025-05-07T19:45:48.9773093Z 2025-05-07T19:45:48.9773103Z 2025-05-07T19:45:48.9885585Z cuda-nsight-12.6.77 | 113.2 MB | 9 | 10%  2025-05-07T19:45:48.9886151Z 2025-05-07T19:45:48.9886159Z 2025-05-07T19:45:48.9886166Z 2025-05-07T19:45:48.9923373Z libcusparse-12.5.4.2 | 118.6 MB | 9 | 10%  2025-05-07T19:45:48.9923773Z 2025-05-07T19:45:49.0756732Z libcublas-12.6.4.1 | 256.2 MB | 5 | 6%  2025-05-07T19:45:49.0757115Z 2025-05-07T19:45:49.0757120Z 2025-05-07T19:45:49.0775915Z libcufft-11.3.0.4 | 156.2 MB | #1 | 12%  2025-05-07T19:45:49.0776378Z 2025-05-07T19:45:49.0776383Z 2025-05-07T19:45:49.0776388Z 2025-05-07T19:45:49.0776392Z 2025-05-07T19:45:49.0797695Z cuda-nsight-12.6.77 | 113.2 MB | #4 | 14%  2025-05-07T19:45:49.0885984Z nsight-compute-2024. | 443.1 MB | 5 | 5% 2025-05-07T19:45:49.0886531Z 2025-05-07T19:45:49.0886540Z 2025-05-07T19:45:49.0886557Z 2025-05-07T19:45:49.0923473Z libcusparse-12.5.4.2 | 118.6 MB | #4 | 15%  2025-05-07T19:45:49.0923974Z 2025-05-07T19:45:49.1756251Z libcublas-12.6.4.1 | 256.2 MB | 7 | 8%  2025-05-07T19:45:49.1756665Z 2025-05-07T19:45:49.1756671Z 2025-05-07T19:45:49.1775685Z libcufft-11.3.0.4 | 156.2 MB | #6 | 16%  2025-05-07T19:45:49.1776090Z 2025-05-07T19:45:49.1776201Z 2025-05-07T19:45:49.1776205Z 2025-05-07T19:45:49.1776310Z 2025-05-07T19:45:49.1798054Z cuda-nsight-12.6.77 | 113.2 MB | #9 | 19%  2025-05-07T19:45:49.1924184Z nsight-compute-2024. | 443.1 MB | 7 | 7% 2025-05-07T19:45:49.1924495Z 2025-05-07T19:45:49.2778225Z libcublas-12.6.4.1 | 256.2 MB | # | 11%  2025-05-07T19:45:49.2778557Z 2025-05-07T19:45:49.2778601Z 2025-05-07T19:45:49.2778924Z libcufft-11.3.0.4 | 156.2 MB | #9 | 20%  2025-05-07T19:45:49.2779192Z 2025-05-07T19:45:49.2779197Z 2025-05-07T19:45:49.2779201Z 2025-05-07T19:45:49.2779204Z 2025-05-07T19:45:49.2797841Z cuda-nsight-12.6.77 | 113.2 MB | ##4 | 25%  2025-05-07T19:45:49.3004884Z nsight-compute-2024. | 443.1 MB | 8 | 9% 2025-05-07T19:45:49.3005168Z 2025-05-07T19:45:49.3117715Z libcublas-12.6.4.1 | 256.2 MB | #2 | 13%  2025-05-07T19:45:49.3118009Z 2025-05-07T19:45:49.3118014Z 2025-05-07T19:45:49.3118017Z 2025-05-07T19:45:49.3783028Z libcusparse-12.5.4.2 | 118.6 MB | #8 | 18%  2025-05-07T19:45:49.3783434Z 2025-05-07T19:45:49.3783442Z 2025-05-07T19:45:49.3783448Z 2025-05-07T19:45:49.3783455Z 2025-05-07T19:45:49.3797747Z cuda-nsight-12.6.77 | 113.2 MB | ##9 | 29%  2025-05-07T19:45:49.3815617Z nsight-compute-2024. | 443.1 MB | #1 | 11% 2025-05-07T19:45:49.3816156Z 2025-05-07T19:45:49.3816168Z 2025-05-07T19:45:49.4108950Z libcufft-11.3.0.4 | 156.2 MB | ##3 | 23%  2025-05-07T19:45:49.4109261Z 2025-05-07T19:45:49.4115616Z libcublas-12.6.4.1 | 256.2 MB | #5 | 15%  2025-05-07T19:45:49.4115980Z 2025-05-07T19:45:49.4116001Z 2025-05-07T19:45:49.4116663Z 2025-05-07T19:45:49.4815764Z libcusparse-12.5.4.2 | 118.6 MB | ##3 | 24%  2025-05-07T19:45:49.4816123Z 2025-05-07T19:45:49.4816132Z 2025-05-07T19:45:49.5008365Z libcufft-11.3.0.4 | 156.2 MB | ##7 | 27%  2025-05-07T19:45:49.5008770Z 2025-05-07T19:45:49.5008846Z 2025-05-07T19:45:49.5008852Z 2025-05-07T19:45:49.5009016Z 2025-05-07T19:45:49.5093223Z cuda-nsight-12.6.77 | 113.2 MB | ###4 | 34%  2025-05-07T19:45:49.5108494Z nsight-compute-2024. | 443.1 MB | #2 | 13% 2025-05-07T19:45:49.5108857Z 2025-05-07T19:45:49.5116182Z libcublas-12.6.4.1 | 256.2 MB | #7 | 18%  2025-05-07T19:45:49.5116468Z 2025-05-07T19:45:49.5116516Z 2025-05-07T19:45:49.5116701Z 2025-05-07T19:45:49.5819848Z libcusparse-12.5.4.2 | 118.6 MB | ##9 | 29%  2025-05-07T19:45:49.5820171Z 2025-05-07T19:45:49.5820182Z 2025-05-07T19:45:49.6083439Z libcufft-11.3.0.4 | 156.2 MB | ###1 | 31%  2025-05-07T19:45:49.6083730Z 2025-05-07T19:45:49.6083736Z 2025-05-07T19:45:49.6083741Z 2025-05-07T19:45:49.6083752Z 2025-05-07T19:45:49.6110248Z cuda-nsight-12.6.77 | 113.2 MB | ###8 | 39%  2025-05-07T19:45:49.6110589Z 2025-05-07T19:45:49.6117466Z libcublas-12.6.4.1 | 256.2 MB | ## | 20%  2025-05-07T19:45:49.6117727Z 2025-05-07T19:45:49.6117731Z 2025-05-07T19:45:49.6117782Z 2025-05-07T19:45:49.6283391Z libcusparse-12.5.4.2 | 118.6 MB | ###4 | 34%  2025-05-07T19:45:49.6822835Z nsight-compute-2024. | 443.1 MB | #4 | 15% 2025-05-07T19:45:49.6823140Z 2025-05-07T19:45:49.6823363Z 2025-05-07T19:45:49.7097556Z libcufft-11.3.0.4 | 156.2 MB | ###5 | 35%  2025-05-07T19:45:49.7097882Z 2025-05-07T19:45:49.7097888Z 2025-05-07T19:45:49.7097891Z 2025-05-07T19:45:49.7097895Z 2025-05-07T19:45:49.7114233Z cuda-nsight-12.6.77 | 113.2 MB | ####3 | 43%  2025-05-07T19:45:49.7114618Z 2025-05-07T19:45:49.7120259Z libcublas-12.6.4.1 | 256.2 MB | ##2 | 22%  2025-05-07T19:45:49.7120541Z 2025-05-07T19:45:49.7120546Z 2025-05-07T19:45:49.7120907Z 2025-05-07T19:45:49.7402979Z libcusparse-12.5.4.2 | 118.6 MB | ###9 | 39%  2025-05-07T19:45:49.7843476Z nsight-compute-2024. | 443.1 MB | #6 | 16% 2025-05-07T19:45:49.7843785Z 2025-05-07T19:45:49.7843790Z 2025-05-07T19:45:49.8103718Z libcufft-11.3.0.4 | 156.2 MB | ###8 | 39%  2025-05-07T19:45:49.8104021Z 2025-05-07T19:45:49.8104030Z 2025-05-07T19:45:49.8104034Z 2025-05-07T19:45:49.8104038Z 2025-05-07T19:45:49.8110259Z cuda-nsight-12.6.77 | 113.2 MB | ####8 | 48%  2025-05-07T19:45:49.8111365Z 2025-05-07T19:45:49.8121574Z libcublas-12.6.4.1 | 256.2 MB | ##4 | 25%  2025-05-07T19:45:49.8121855Z 2025-05-07T19:45:49.8121860Z 2025-05-07T19:45:49.8122312Z 2025-05-07T19:45:49.8524585Z libcusparse-12.5.4.2 | 118.6 MB | ####3 | 44%  2025-05-07T19:45:49.8854622Z nsight-compute-2024. | 443.1 MB | #7 | 18% 2025-05-07T19:45:49.8855082Z 2025-05-07T19:45:49.8855095Z 2025-05-07T19:45:49.9104406Z libcufft-11.3.0.4 | 156.2 MB | ####2 | 43%  2025-05-07T19:45:49.9104760Z 2025-05-07T19:45:49.9104766Z 2025-05-07T19:45:49.9104773Z 2025-05-07T19:45:49.9104781Z 2025-05-07T19:45:49.9111140Z cuda-nsight-12.6.77 | 113.2 MB | #####2 | 53%  2025-05-07T19:45:49.9111456Z 2025-05-07T19:45:49.9123392Z libcublas-12.6.4.1 | 256.2 MB | ##7 | 27%  2025-05-07T19:45:49.9123660Z 2025-05-07T19:45:49.9123665Z 2025-05-07T19:45:49.9124496Z 2025-05-07T19:45:49.9622331Z libcusparse-12.5.4.2 | 118.6 MB | ####9 | 49%  2025-05-07T19:45:49.9856895Z nsight-compute-2024. | 443.1 MB | #9 | 19% 2025-05-07T19:45:49.9857377Z 2025-05-07T19:45:50.0105400Z 2025-05-07T19:45:50.0105867Z libcufft-11.3.0.4 | 156.2 MB | ####6 | 47%  2025-05-07T19:45:50.0106162Z 2025-05-07T19:45:50.0106167Z 2025-05-07T19:45:50.0106172Z 2025-05-07T19:45:50.0106178Z 2025-05-07T19:45:50.0125111Z cuda-nsight-12.6.77 | 113.2 MB | #####7 | 58%  2025-05-07T19:45:50.0125398Z 2025-05-07T19:45:50.0125402Z 2025-05-07T19:45:50.0125717Z 2025-05-07T19:45:50.0201647Z libcusparse-12.5.4.2 | 118.6 MB | #####4 | 54%  2025-05-07T19:45:50.0201956Z 2025-05-07T19:45:50.0681318Z libcublas-12.6.4.1 | 256.2 MB | ##9 | 30%  2025-05-07T19:45:50.0877554Z nsight-compute-2024. | 443.1 MB | ## | 21% 2025-05-07T19:45:50.0877840Z 2025-05-07T19:45:50.0877845Z 2025-05-07T19:45:50.1107424Z libcufft-11.3.0.4 | 156.2 MB | ##### | 51%  2025-05-07T19:45:50.1107725Z 2025-05-07T19:45:50.1107732Z 2025-05-07T19:45:50.1107766Z 2025-05-07T19:45:50.1107772Z 2025-05-07T19:45:50.1129360Z cuda-nsight-12.6.77 | 113.2 MB | ######2 | 63%  2025-05-07T19:45:50.1129831Z 2025-05-07T19:45:50.1129865Z 2025-05-07T19:45:50.1129872Z 2025-05-07T19:45:50.1202620Z libcusparse-12.5.4.2 | 118.6 MB | #####9 | 60%  2025-05-07T19:45:50.1204471Z 2025-05-07T19:45:50.1681988Z libcublas-12.6.4.1 | 256.2 MB | ###1 | 32%  2025-05-07T19:45:50.2129571Z nsight-compute-2024. | 443.1 MB | ##2 | 22% 2025-05-07T19:45:50.2130040Z 2025-05-07T19:45:50.2130080Z 2025-05-07T19:45:50.2130087Z 2025-05-07T19:45:50.2130436Z libcusparse-12.5.4.2 | 118.6 MB | ######6 | 66%  2025-05-07T19:45:50.2130718Z 2025-05-07T19:45:50.2131674Z 2025-05-07T19:45:50.2173818Z libcufft-11.3.0.4 | 156.2 MB | #####4 | 55%  2025-05-07T19:45:50.2174112Z 2025-05-07T19:45:50.2174120Z 2025-05-07T19:45:50.2174127Z 2025-05-07T19:45:50.2174131Z 2025-05-07T19:45:50.2202979Z cuda-nsight-12.6.77 | 113.2 MB | ######7 | 68%  2025-05-07T19:45:50.2203320Z 2025-05-07T19:45:50.2682689Z libcublas-12.6.4.1 | 256.2 MB | ###5 | 35%  2025-05-07T19:45:50.3161060Z nsight-compute-2024. | 443.1 MB | ##4 | 24% 2025-05-07T19:45:50.3161364Z 2025-05-07T19:45:50.3161370Z 2025-05-07T19:45:50.3161826Z 2025-05-07T19:45:50.3173020Z libcusparse-12.5.4.2 | 118.6 MB | #######1 | 72%  2025-05-07T19:45:50.3173342Z 2025-05-07T19:45:50.3173352Z 2025-05-07T19:45:50.3173356Z 2025-05-07T19:45:50.3176247Z 2025-05-07T19:45:50.3177970Z cuda-nsight-12.6.77 | 113.2 MB | #######2 | 73%  2025-05-07T19:45:50.3178313Z 2025-05-07T19:45:50.3178642Z 2025-05-07T19:45:50.3558685Z libcufft-11.3.0.4 | 156.2 MB | #####8 | 58%  2025-05-07T19:45:50.3558996Z 2025-05-07T19:45:50.3830433Z libcublas-12.6.4.1 | 256.2 MB | ###7 | 38%  2025-05-07T19:45:50.4177901Z nsight-compute-2024. | 443.1 MB | ##5 | 26% 2025-05-07T19:45:50.4178346Z 2025-05-07T19:45:50.4178399Z 2025-05-07T19:45:50.4178409Z 2025-05-07T19:45:50.4178878Z libcusparse-12.5.4.2 | 118.6 MB | #######6 | 77%  2025-05-07T19:45:50.4179191Z 2025-05-07T19:45:50.4179198Z 2025-05-07T19:45:50.4333224Z libcufft-11.3.0.4 | 156.2 MB | ######1 | 62%  2025-05-07T19:45:50.4333513Z 2025-05-07T19:45:50.4333525Z 2025-05-07T19:45:50.4333530Z 2025-05-07T19:45:50.4333551Z 2025-05-07T19:45:50.4558553Z cuda-nsight-12.6.77 | 113.2 MB | #######7 | 78%  2025-05-07T19:45:50.4558868Z 2025-05-07T19:45:50.4913920Z libcublas-12.6.4.1 | 256.2 MB | #### | 40%  2025-05-07T19:45:50.5181644Z nsight-compute-2024. | 443.1 MB | ##7 | 27% 2025-05-07T19:45:50.5181936Z 2025-05-07T19:45:50.5181955Z 2025-05-07T19:45:50.5181959Z 2025-05-07T19:45:50.5316591Z libcusparse-12.5.4.2 | 118.6 MB | ########2 | 82%  2025-05-07T19:45:50.5317227Z 2025-05-07T19:45:50.5317237Z 2025-05-07T19:45:50.5348574Z libcufft-11.3.0.4 | 156.2 MB | ######5 | 66%  2025-05-07T19:45:50.5349145Z 2025-05-07T19:45:50.5349150Z 2025-05-07T19:45:50.5349153Z 2025-05-07T19:45:50.5349157Z 2025-05-07T19:45:50.5559450Z cuda-nsight-12.6.77 | 113.2 MB | ########2 | 82%  2025-05-07T19:45:50.5559781Z 2025-05-07T19:45:50.5914043Z libcublas-12.6.4.1 | 256.2 MB | ####2 | 43%  2025-05-07T19:45:50.6197253Z nsight-compute-2024. | 443.1 MB | ##9 | 29% 2025-05-07T19:45:50.6197560Z 2025-05-07T19:45:50.6197566Z 2025-05-07T19:45:50.6197571Z 2025-05-07T19:45:50.6393424Z libcusparse-12.5.4.2 | 118.6 MB | ########7 | 88%  2025-05-07T19:45:50.6393762Z 2025-05-07T19:45:50.6393770Z 2025-05-07T19:45:50.6719174Z libcufft-11.3.0.4 | 156.2 MB | ######9 | 69%  2025-05-07T19:45:50.6719464Z 2025-05-07T19:45:50.6719469Z 2025-05-07T19:45:50.6719473Z 2025-05-07T19:45:50.6719695Z 2025-05-07T19:45:50.6899128Z cuda-nsight-12.6.77 | 113.2 MB | ########7 | 87%  2025-05-07T19:45:50.6899456Z 2025-05-07T19:45:50.6914872Z libcublas-12.6.4.1 | 256.2 MB | ####5 | 45%  2025-05-07T19:45:50.7603102Z nsight-compute-2024. | 443.1 MB | ###1 | 32% 2025-05-07T19:45:50.7603396Z 2025-05-07T19:45:50.7603416Z 2025-05-07T19:45:50.7603421Z 2025-05-07T19:45:50.7720437Z libcusparse-12.5.4.2 | 118.6 MB | #########3 | 93%  2025-05-07T19:45:50.7720753Z 2025-05-07T19:45:50.7720761Z 2025-05-07T19:45:50.7720765Z 2025-05-07T19:45:50.7721184Z 2025-05-07T19:45:50.7734212Z cuda-nsight-12.6.77 | 113.2 MB | #########2 | 92%  2025-05-07T19:45:50.7735150Z 2025-05-07T19:45:50.7735508Z 2025-05-07T19:45:50.7902228Z libcufft-11.3.0.4 | 156.2 MB | #######2 | 72%  2025-05-07T19:45:50.7902524Z 2025-05-07T19:45:50.7911003Z libcublas-12.6.4.1 | 256.2 MB | ####8 | 48%  2025-05-07T19:45:50.8720768Z nsight-compute-2024. | 443.1 MB | ###3 | 34% 2025-05-07T19:45:50.8721150Z 2025-05-07T19:45:50.8721157Z 2025-05-07T19:45:50.8721163Z 2025-05-07T19:45:50.8721171Z 2025-05-07T19:45:50.8733210Z cuda-nsight-12.6.77 | 113.2 MB | #########9 | 99%  2025-05-07T19:45:50.8733538Z 2025-05-07T19:45:50.8734498Z 2025-05-07T19:45:50.8903497Z libcufft-11.3.0.4 | 156.2 MB | #######5 | 76%  2025-05-07T19:45:50.8903779Z 2025-05-07T19:45:50.9033250Z libcublas-12.6.4.1 | 256.2 MB | #####1 | 52%  2025-05-07T19:45:50.9738613Z nsight-compute-2024. | 443.1 MB | ###5 | 36% 2025-05-07T19:45:50.9739031Z 2025-05-07T19:45:50.9739069Z 2025-05-07T19:45:50.9985466Z libcufft-11.3.0.4 | 156.2 MB | #######9 | 80%  2025-05-07T19:45:50.9985765Z 2025-05-07T19:45:51.0177887Z libcublas-12.6.4.1 | 256.2 MB | #####5 | 56%  2025-05-07T19:45:51.0987443Z nsight-compute-2024. | 443.1 MB | ###7 | 37% 2025-05-07T19:45:51.0987754Z 2025-05-07T19:45:51.1178172Z libcublas-12.6.4.1 | 256.2 MB | #####9 | 60%  2025-05-07T19:45:51.2314130Z nsight-compute-2024. | 443.1 MB | #### | 40% 2025-05-07T19:45:51.2314504Z 2025-05-07T19:45:51.2551616Z libcublas-12.6.4.1 | 256.2 MB | ######3 | 63%  2025-05-07T19:45:51.2551923Z 2025-05-07T19:45:51.2552128Z 2025-05-07T19:45:51.2552138Z 2025-05-07T19:45:51.2553007Z libcusparse-12.5.4.2 | 118.6 MB | #########7 | 98%  2025-05-07T19:45:51.2848061Z nsight-compute-2024. | 443.1 MB | ####2 | 42% 2025-05-07T19:45:51.2848382Z 2025-05-07T19:45:51.2848388Z 2025-05-07T19:45:51.3567661Z libcufft-11.3.0.4 | 156.2 MB | ########3 | 83%  2025-05-07T19:45:51.3737922Z nsight-compute-2024. | 443.1 MB | ####5 | 45% 2025-05-07T19:45:51.3738474Z 2025-05-07T19:45:51.3738498Z 2025-05-07T19:45:51.3738507Z 2025-05-07T19:45:51.3738516Z 2025-05-07T19:45:51.3848529Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:45:51.3849058Z 2025-05-07T19:45:51.3849063Z 2025-05-07T19:45:51.4118882Z libcufft-11.3.0.4 | 156.2 MB | ########6 | 87%  2025-05-07T19:45:51.4119177Z 2025-05-07T19:45:51.4119183Z 2025-05-07T19:45:51.4119480Z 2025-05-07T19:45:51.4119485Z 2025-05-07T19:45:51.4119509Z 2025-05-07T19:45:51.4570543Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:45:51.4727774Z nsight-compute-2024. | 443.1 MB | ####7 | 47% 2025-05-07T19:45:51.4728093Z 2025-05-07T19:45:51.5119180Z libcublas-12.6.4.1 | 256.2 MB | ######6 | 67%  2025-05-07T19:45:51.5119504Z 2025-05-07T19:45:51.5119514Z 2025-05-07T19:45:51.5119519Z 2025-05-07T19:45:51.5119525Z 2025-05-07T19:45:51.5119530Z 2025-05-07T19:45:51.5158216Z cuda-nvvp-12.6.80 | 109.3 MB | 7 | 7%  2025-05-07T19:45:51.5158527Z 2025-05-07T19:45:51.5158538Z 2025-05-07T19:45:51.5735581Z libcufft-11.3.0.4 | 156.2 MB | ########9 | 90%  2025-05-07T19:45:51.5735942Z 2025-05-07T19:45:51.6122164Z libcublas-12.6.4.1 | 256.2 MB | ######9 | 69%  2025-05-07T19:45:51.6122479Z 2025-05-07T19:45:51.6122486Z 2025-05-07T19:45:51.6122491Z 2025-05-07T19:45:51.6122495Z 2025-05-07T19:45:51.6123476Z 2025-05-07T19:45:51.6163443Z cuda-nvvp-12.6.80 | 109.3 MB | #2 | 13%  2025-05-07T19:45:51.6163759Z 2025-05-07T19:45:51.6163770Z 2025-05-07T19:45:51.6618338Z libcufft-11.3.0.4 | 156.2 MB | #########3 | 93%  2025-05-07T19:45:51.6863542Z nsight-compute-2024. | 443.1 MB | ####9 | 50% 2025-05-07T19:45:51.6863886Z 2025-05-07T19:45:51.7124072Z libcublas-12.6.4.1 | 256.2 MB | #######2 | 72%  2025-05-07T19:45:51.7124540Z 2025-05-07T19:45:51.7124545Z 2025-05-07T19:45:51.7124548Z 2025-05-07T19:45:51.7124553Z 2025-05-07T19:45:51.7124556Z 2025-05-07T19:45:51.7162715Z cuda-nvvp-12.6.80 | 109.3 MB | #8 | 18%  2025-05-07T19:45:51.7163160Z 2025-05-07T19:45:51.7163165Z 2025-05-07T19:45:51.7696553Z libcufft-11.3.0.4 | 156.2 MB | #########7 | 97%  2025-05-07T19:45:51.8125412Z nsight-compute-2024. | 443.1 MB | #####1 | 51% 2025-05-07T19:45:51.8125935Z 2025-05-07T19:45:51.8125944Z 2025-05-07T19:45:51.8125954Z 2025-05-07T19:45:51.8126004Z 2025-05-07T19:45:51.8126009Z 2025-05-07T19:45:51.8701584Z cuda-nvvp-12.6.80 | 109.3 MB | ##5 | 25%  2025-05-07T19:45:51.9126479Z nsight-compute-2024. | 443.1 MB | #####3 | 54% 2025-05-07T19:45:51.9127034Z 2025-05-07T19:45:51.9127042Z 2025-05-07T19:45:51.9127047Z 2025-05-07T19:45:51.9127053Z 2025-05-07T19:45:51.9127058Z 2025-05-07T19:45:51.9803407Z cuda-nvvp-12.6.80 | 109.3 MB | ###3 | 34%  2025-05-07T19:45:52.0265554Z nsight-compute-2024. | 443.1 MB | #####5 | 56% 2025-05-07T19:45:52.0266045Z 2025-05-07T19:45:52.0266052Z 2025-05-07T19:45:52.0266056Z 2025-05-07T19:45:52.0266061Z 2025-05-07T19:45:52.0266323Z 2025-05-07T19:45:52.0374569Z cuda-nvvp-12.6.80 | 109.3 MB | ####3 | 44%  2025-05-07T19:45:52.0374987Z 2025-05-07T19:45:52.0869151Z libcublas-12.6.4.1 | 256.2 MB | #######4 | 75%  2025-05-07T19:45:52.1510227Z nsight-compute-2024. | 443.1 MB | #####7 | 58% 2025-05-07T19:45:52.1510623Z 2025-05-07T19:45:52.1595556Z libcublas-12.6.4.1 | 256.2 MB | #######6 | 77%  2025-05-07T19:45:52.1596047Z 2025-05-07T19:45:52.1596057Z 2025-05-07T19:45:52.1596063Z 2025-05-07T19:45:52.1596070Z 2025-05-07T19:45:52.1596079Z 2025-05-07T19:45:52.1741571Z cuda-nvvp-12.6.80 | 109.3 MB | #####1 | 51%  2025-05-07T19:45:52.1742043Z 2025-05-07T19:45:52.1742048Z 2025-05-07T19:45:52.1742052Z 2025-05-07T19:45:52.1742057Z 2025-05-07T19:45:52.2199151Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:45:52.2512715Z nsight-compute-2024. | 443.1 MB | #####9 | 60% 2025-05-07T19:45:52.2513190Z 2025-05-07T19:45:52.2726893Z libcublas-12.6.4.1 | 256.2 MB | #######8 | 79%  2025-05-07T19:45:52.2727187Z 2025-05-07T19:45:52.2727192Z 2025-05-07T19:45:52.2727196Z 2025-05-07T19:45:52.2727200Z 2025-05-07T19:45:52.2727209Z 2025-05-07T19:45:52.3024715Z cuda-nvvp-12.6.80 | 109.3 MB | #####7 | 58%  2025-05-07T19:45:52.3025340Z 2025-05-07T19:45:52.3025345Z 2025-05-07T19:45:52.3025348Z 2025-05-07T19:45:52.3199677Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:45:52.3502517Z nsight-compute-2024. | 443.1 MB | ######2 | 62% 2025-05-07T19:45:52.3502807Z 2025-05-07T19:45:52.3502813Z 2025-05-07T19:45:52.3502819Z 2025-05-07T19:45:52.3502823Z 2025-05-07T19:45:52.3502829Z 2025-05-07T19:45:52.3502834Z 2025-05-07T19:45:52.3508407Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:45:52.3508762Z 2025-05-07T19:45:52.3939853Z libcublas-12.6.4.1 | 256.2 MB | ######## | 81%  2025-05-07T19:45:52.3940433Z 2025-05-07T19:45:52.3940500Z 2025-05-07T19:45:52.3940507Z 2025-05-07T19:45:52.3940515Z 2025-05-07T19:45:52.3940523Z 2025-05-07T19:45:52.4513140Z cuda-nvvp-12.6.80 | 109.3 MB | ######4 | 64%  2025-05-07T19:45:52.4513489Z 2025-05-07T19:45:52.4643345Z libcublas-12.6.4.1 | 256.2 MB | ########3 | 83%  2025-05-07T19:45:52.4643952Z 2025-05-07T19:45:52.4643959Z 2025-05-07T19:45:52.4643966Z 2025-05-07T19:45:52.4643974Z 2025-05-07T19:45:52.4643981Z 2025-05-07T19:45:52.4643987Z 2025-05-07T19:45:52.5034591Z libcusolver-11.7.1.2 | 95.8 MB | 2 | 3%  2025-05-07T19:45:52.5034974Z 2025-05-07T19:45:52.5034980Z 2025-05-07T19:45:52.5034984Z 2025-05-07T19:45:52.5034989Z 2025-05-07T19:45:52.5034992Z 2025-05-07T19:45:52.5219509Z cuda-nvvp-12.6.80 | 109.3 MB | ####### | 71%  2025-05-07T19:45:52.5773699Z nsight-compute-2024. | 443.1 MB | ######4 | 64% 2025-05-07T19:45:52.5774222Z 2025-05-07T19:45:52.5826232Z libcublas-12.6.4.1 | 256.2 MB | ########5 | 86%  2025-05-07T19:45:52.5826660Z 2025-05-07T19:45:52.5826666Z 2025-05-07T19:45:52.5826726Z 2025-05-07T19:45:52.5826738Z 2025-05-07T19:45:52.5826743Z 2025-05-07T19:45:52.5826748Z 2025-05-07T19:45:52.6074112Z libcusolver-11.7.1.2 | 95.8 MB | 5 | 5%  2025-05-07T19:45:52.6074554Z 2025-05-07T19:45:52.6074560Z 2025-05-07T19:45:52.6074565Z 2025-05-07T19:45:52.6074568Z 2025-05-07T19:45:52.6074572Z 2025-05-07T19:45:52.6221333Z cuda-nvvp-12.6.80 | 109.3 MB | #######6 | 77%  2025-05-07T19:45:52.6827231Z nsight-compute-2024. | 443.1 MB | ######6 | 66% 2025-05-07T19:45:52.6827550Z 2025-05-07T19:45:52.6827728Z 2025-05-07T19:45:52.6827760Z 2025-05-07T19:45:52.6827767Z 2025-05-07T19:45:52.6827848Z 2025-05-07T19:45:52.6827857Z 2025-05-07T19:45:52.6942552Z libcusolver-11.7.1.2 | 95.8 MB | 9 | 10%  2025-05-07T19:45:52.6942893Z 2025-05-07T19:45:52.7264796Z libcublas-12.6.4.1 | 256.2 MB | ########7 | 88%  2025-05-07T19:45:52.7265241Z 2025-05-07T19:45:52.7265247Z 2025-05-07T19:45:52.7265250Z 2025-05-07T19:45:52.7265254Z 2025-05-07T19:45:52.7265257Z 2025-05-07T19:45:52.7678636Z cuda-nvvp-12.6.80 | 109.3 MB | ########2 | 83%  2025-05-07T19:45:52.7829124Z nsight-compute-2024. | 443.1 MB | ######7 | 68% 2025-05-07T19:45:52.7830036Z 2025-05-07T19:45:52.7830058Z 2025-05-07T19:45:52.7830074Z 2025-05-07T19:45:52.7830089Z 2025-05-07T19:45:52.7830106Z 2025-05-07T19:45:52.7830125Z 2025-05-07T19:45:52.7945401Z libcusolver-11.7.1.2 | 95.8 MB | #5 | 16%  2025-05-07T19:45:52.7945734Z 2025-05-07T19:45:52.8441412Z libcublas-12.6.4.1 | 256.2 MB | ########9 | 90%  2025-05-07T19:45:52.8441750Z 2025-05-07T19:45:52.8441756Z 2025-05-07T19:45:52.8441761Z 2025-05-07T19:45:52.8441766Z 2025-05-07T19:45:52.8441770Z 2025-05-07T19:45:52.8831865Z cuda-nvvp-12.6.80 | 109.3 MB | ########8 | 88%  2025-05-07T19:45:52.8832256Z 2025-05-07T19:45:52.8832313Z 2025-05-07T19:45:52.8832319Z 2025-05-07T19:45:52.8832325Z 2025-05-07T19:45:52.8832330Z 2025-05-07T19:45:52.8832336Z 2025-05-07T19:45:52.8945399Z libcusolver-11.7.1.2 | 95.8 MB | ##2 | 23%  2025-05-07T19:45:52.8945763Z 2025-05-07T19:45:52.8963413Z libcublas-12.6.4.1 | 256.2 MB | #########1 | 92%  2025-05-07T19:45:52.9456898Z nsight-compute-2024. | 443.1 MB | ######9 | 70% 2025-05-07T19:45:52.9457209Z 2025-05-07T19:45:52.9457404Z 2025-05-07T19:45:52.9457412Z 2025-05-07T19:45:52.9457418Z 2025-05-07T19:45:52.9457423Z 2025-05-07T19:45:52.9835662Z cuda-nvvp-12.6.80 | 109.3 MB | #########3 | 94%  2025-05-07T19:45:52.9836085Z 2025-05-07T19:45:52.9836091Z 2025-05-07T19:45:52.9836098Z 2025-05-07T19:45:52.9836103Z 2025-05-07T19:45:52.9836130Z 2025-05-07T19:45:52.9836133Z 2025-05-07T19:45:52.9945472Z libcusolver-11.7.1.2 | 95.8 MB | ##9 | 29%  2025-05-07T19:45:52.9945927Z 2025-05-07T19:45:53.0151619Z libcublas-12.6.4.1 | 256.2 MB | #########4 | 94%  2025-05-07T19:45:53.0186532Z nsight-compute-2024. | 443.1 MB | #######1 | 71% 2025-05-07T19:45:53.0187068Z 2025-05-07T19:45:53.0187077Z 2025-05-07T19:45:53.0482636Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:45:53.0482944Z 2025-05-07T19:45:53.0482977Z 2025-05-07T19:45:53.0482982Z 2025-05-07T19:45:53.0482986Z 2025-05-07T19:45:53.0482990Z 2025-05-07T19:45:53.0538010Z cuda-nvvp-12.6.80 | 109.3 MB | #########9 | 99%  2025-05-07T19:45:53.0538445Z 2025-05-07T19:45:53.0538455Z 2025-05-07T19:45:53.0538463Z 2025-05-07T19:45:53.0538470Z 2025-05-07T19:45:53.0538475Z 2025-05-07T19:45:53.0538482Z 2025-05-07T19:45:53.0538491Z 2025-05-07T19:45:53.0836361Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:45:53.0836879Z 2025-05-07T19:45:53.0836886Z 2025-05-07T19:45:53.0836890Z 2025-05-07T19:45:53.0836893Z 2025-05-07T19:45:53.0836897Z 2025-05-07T19:45:53.0836929Z 2025-05-07T19:45:53.1109344Z libcusolver-11.7.1.2 | 95.8 MB | ###5 | 35%  2025-05-07T19:45:53.1109810Z 2025-05-07T19:45:53.1419348Z libcublas-12.6.4.1 | 256.2 MB | #########6 | 96%  2025-05-07T19:45:53.1542983Z nsight-compute-2024. | 443.1 MB | #######2 | 73% 2025-05-07T19:45:53.1543400Z 2025-05-07T19:45:53.1543597Z 2025-05-07T19:45:53.1543606Z 2025-05-07T19:45:53.1543612Z 2025-05-07T19:45:53.1543617Z 2025-05-07T19:45:53.1543623Z 2025-05-07T19:45:53.1543630Z 2025-05-07T19:45:53.1837967Z libnpp-12.3.1.54 | 93.4 MB | 4 | 5%  2025-05-07T19:45:53.1838321Z 2025-05-07T19:45:53.1838327Z 2025-05-07T19:45:53.1838332Z 2025-05-07T19:45:53.1838336Z 2025-05-07T19:45:53.1838343Z 2025-05-07T19:45:53.1838349Z 2025-05-07T19:45:53.2200614Z libcusolver-11.7.1.2 | 95.8 MB | #### | 41%  2025-05-07T19:45:53.2200981Z 2025-05-07T19:45:53.2545746Z libcublas-12.6.4.1 | 256.2 MB | #########8 | 98%  2025-05-07T19:45:53.2546315Z 2025-05-07T19:45:53.2546321Z 2025-05-07T19:45:53.2546324Z 2025-05-07T19:45:53.2546329Z 2025-05-07T19:45:53.2546333Z 2025-05-07T19:45:53.2546366Z 2025-05-07T19:45:53.2546370Z 2025-05-07T19:45:53.2704337Z libnpp-12.3.1.54 | 93.4 MB | # | 11%  2025-05-07T19:45:53.2909595Z nsight-compute-2024. | 443.1 MB | #######3 | 74% 2025-05-07T19:45:53.2910050Z 2025-05-07T19:45:53.2910061Z 2025-05-07T19:45:53.2910073Z 2025-05-07T19:45:53.2910080Z 2025-05-07T19:45:53.2910088Z 2025-05-07T19:45:53.2910098Z 2025-05-07T19:45:53.3547573Z libcusolver-11.7.1.2 | 95.8 MB | ####6 | 47%  2025-05-07T19:45:53.3548025Z 2025-05-07T19:45:53.3548030Z 2025-05-07T19:45:53.3548036Z 2025-05-07T19:45:53.3548042Z 2025-05-07T19:45:53.3548047Z 2025-05-07T19:45:53.3548052Z 2025-05-07T19:45:53.3548058Z 2025-05-07T19:45:53.3773064Z libnpp-12.3.1.54 | 93.4 MB | #7 | 17%  2025-05-07T19:45:53.3912107Z nsight-compute-2024. | 443.1 MB | #######5 | 75% 2025-05-07T19:45:53.3912582Z 2025-05-07T19:45:53.3912591Z 2025-05-07T19:45:53.3912597Z 2025-05-07T19:45:53.3912601Z 2025-05-07T19:45:53.3912607Z 2025-05-07T19:45:53.3912612Z 2025-05-07T19:45:53.4549701Z libcusolver-11.7.1.2 | 95.8 MB | #####3 | 54%  2025-05-07T19:45:53.4550371Z 2025-05-07T19:45:53.4550377Z 2025-05-07T19:45:53.4550382Z 2025-05-07T19:45:53.4550387Z 2025-05-07T19:45:53.4550392Z 2025-05-07T19:45:53.4550396Z 2025-05-07T19:45:53.4550401Z 2025-05-07T19:45:53.4773274Z libnpp-12.3.1.54 | 93.4 MB | ##4 | 25%  2025-05-07T19:45:53.4913099Z nsight-compute-2024. | 443.1 MB | #######6 | 77% 2025-05-07T19:45:53.4913610Z 2025-05-07T19:45:53.4913621Z 2025-05-07T19:45:53.4913631Z 2025-05-07T19:45:53.4913639Z 2025-05-07T19:45:53.4913647Z 2025-05-07T19:45:53.4913657Z 2025-05-07T19:45:53.5550244Z libcusolver-11.7.1.2 | 95.8 MB | ###### | 61%  2025-05-07T19:45:53.5550711Z 2025-05-07T19:45:53.5550716Z 2025-05-07T19:45:53.5550721Z 2025-05-07T19:45:53.5550725Z 2025-05-07T19:45:53.5550730Z 2025-05-07T19:45:53.5550733Z 2025-05-07T19:45:53.5550738Z 2025-05-07T19:45:53.5774394Z libnpp-12.3.1.54 | 93.4 MB | ###2 | 32%  2025-05-07T19:45:53.5913715Z nsight-compute-2024. | 443.1 MB | #######7 | 78% 2025-05-07T19:45:53.5914193Z 2025-05-07T19:45:53.5914203Z 2025-05-07T19:45:53.5914209Z 2025-05-07T19:45:53.5914223Z 2025-05-07T19:45:53.5914228Z 2025-05-07T19:45:53.5914234Z 2025-05-07T19:45:53.6551507Z libcusolver-11.7.1.2 | 95.8 MB | ######8 | 68%  2025-05-07T19:45:53.6551976Z 2025-05-07T19:45:53.6552006Z 2025-05-07T19:45:53.6552011Z 2025-05-07T19:45:53.6552016Z 2025-05-07T19:45:53.6552021Z 2025-05-07T19:45:53.6552025Z 2025-05-07T19:45:53.6552030Z 2025-05-07T19:45:53.6775534Z libnpp-12.3.1.54 | 93.4 MB | ###9 | 39%  2025-05-07T19:45:53.6915413Z nsight-compute-2024. | 443.1 MB | #######9 | 79% 2025-05-07T19:45:53.6915771Z 2025-05-07T19:45:53.6916020Z 2025-05-07T19:45:53.6916030Z 2025-05-07T19:45:53.6916035Z 2025-05-07T19:45:53.6916039Z 2025-05-07T19:45:53.6916044Z 2025-05-07T19:45:53.7553768Z libcusolver-11.7.1.2 | 95.8 MB | #######6 | 76%  2025-05-07T19:45:53.7554178Z 2025-05-07T19:45:53.7554184Z 2025-05-07T19:45:53.7554188Z 2025-05-07T19:45:53.7554192Z 2025-05-07T19:45:53.7554195Z 2025-05-07T19:45:53.7554199Z 2025-05-07T19:45:53.7554202Z 2025-05-07T19:45:53.7782053Z libnpp-12.3.1.54 | 93.4 MB | ####7 | 47%  2025-05-07T19:45:53.7916421Z nsight-compute-2024. | 443.1 MB | ######## | 81% 2025-05-07T19:45:53.7916990Z 2025-05-07T19:45:53.7917000Z 2025-05-07T19:45:53.7917008Z 2025-05-07T19:45:53.7917012Z 2025-05-07T19:45:53.7917017Z 2025-05-07T19:45:53.7917023Z 2025-05-07T19:45:53.8555421Z libcusolver-11.7.1.2 | 95.8 MB | ########3 | 84%  2025-05-07T19:45:53.8556135Z 2025-05-07T19:45:53.8556145Z 2025-05-07T19:45:53.8556150Z 2025-05-07T19:45:53.8556153Z 2025-05-07T19:45:53.8556157Z 2025-05-07T19:45:53.8556160Z 2025-05-07T19:45:53.8556165Z 2025-05-07T19:45:53.8781897Z libnpp-12.3.1.54 | 93.4 MB | #####4 | 54%  2025-05-07T19:45:53.8919162Z nsight-compute-2024. | 443.1 MB | ########2 | 82% 2025-05-07T19:45:53.8919505Z 2025-05-07T19:45:53.8919511Z 2025-05-07T19:45:53.8919516Z 2025-05-07T19:45:53.8919520Z 2025-05-07T19:45:53.8919524Z 2025-05-07T19:45:53.8919598Z 2025-05-07T19:45:53.9556765Z libcusolver-11.7.1.2 | 95.8 MB | ######### | 91%  2025-05-07T19:45:53.9557141Z 2025-05-07T19:45:53.9557146Z 2025-05-07T19:45:53.9557149Z 2025-05-07T19:45:53.9557154Z 2025-05-07T19:45:53.9557159Z 2025-05-07T19:45:53.9557164Z 2025-05-07T19:45:53.9557168Z 2025-05-07T19:45:53.9787789Z libnpp-12.3.1.54 | 93.4 MB | ######1 | 62%  2025-05-07T19:45:53.9946849Z nsight-compute-2024. | 443.1 MB | ########3 | 84% 2025-05-07T19:45:53.9947185Z 2025-05-07T19:45:53.9947190Z 2025-05-07T19:45:53.9947194Z 2025-05-07T19:45:53.9947199Z 2025-05-07T19:45:53.9947226Z 2025-05-07T19:45:53.9947231Z 2025-05-07T19:45:54.0557949Z libcusolver-11.7.1.2 | 95.8 MB | #########8 | 98%  2025-05-07T19:45:54.0558629Z 2025-05-07T19:45:54.0558633Z 2025-05-07T19:45:54.0558637Z 2025-05-07T19:45:54.0558641Z 2025-05-07T19:45:54.0558644Z 2025-05-07T19:45:54.0558648Z 2025-05-07T19:45:54.0558651Z 2025-05-07T19:45:54.0787944Z libnpp-12.3.1.54 | 93.4 MB | ####### | 70%  2025-05-07T19:45:54.1558188Z nsight-compute-2024. | 443.1 MB | ########5 | 86% 2025-05-07T19:45:54.1558520Z 2025-05-07T19:45:54.1558527Z 2025-05-07T19:45:54.1558533Z 2025-05-07T19:45:54.1558552Z 2025-05-07T19:45:54.1558560Z 2025-05-07T19:45:54.1558585Z 2025-05-07T19:45:54.1558589Z 2025-05-07T19:45:54.1789206Z libnpp-12.3.1.54 | 93.4 MB | #######9 | 79%  2025-05-07T19:45:54.2558680Z nsight-compute-2024. | 443.1 MB | ########7 | 87% 2025-05-07T19:45:54.2559025Z 2025-05-07T19:45:54.2559031Z 2025-05-07T19:45:54.2559036Z 2025-05-07T19:45:54.2559059Z 2025-05-07T19:45:54.2559063Z 2025-05-07T19:45:54.2559069Z 2025-05-07T19:45:54.2559072Z 2025-05-07T19:45:54.2790194Z libnpp-12.3.1.54 | 93.4 MB | ########8 | 89%  2025-05-07T19:45:54.3558960Z nsight-compute-2024. | 443.1 MB | ########9 | 89% 2025-05-07T19:45:54.3559283Z 2025-05-07T19:45:54.3559290Z 2025-05-07T19:45:54.3559296Z 2025-05-07T19:45:54.3559302Z 2025-05-07T19:45:54.3559307Z 2025-05-07T19:45:54.3559334Z 2025-05-07T19:45:54.3559340Z 2025-05-07T19:45:54.3791042Z libnpp-12.3.1.54 | 93.4 MB | #########7 | 98%  2025-05-07T19:45:54.4791482Z nsight-compute-2024. | 443.1 MB | #########1 | 91% 2025-05-07T19:45:54.5792765Z nsight-compute-2024. | 443.1 MB | #########4 | 94% 2025-05-07T19:45:54.6543904Z nsight-compute-2024. | 443.1 MB | #########6 | 97% 2025-05-07T19:45:54.6544232Z 2025-05-07T19:45:54.6544238Z 2025-05-07T19:45:54.6544242Z 2025-05-07T19:45:54.6544247Z 2025-05-07T19:45:54.6544251Z 2025-05-07T19:45:54.6794324Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:45:54.7123478Z nsight-compute-2024. | 443.1 MB | #########9 | 100% 2025-05-07T19:45:54.7123819Z 2025-05-07T19:45:54.7123824Z 2025-05-07T19:45:54.7123828Z 2025-05-07T19:45:54.7123832Z 2025-05-07T19:45:54.7123835Z 2025-05-07T19:45:54.7123839Z 2025-05-07T19:45:54.7123859Z 2025-05-07T19:45:54.7123864Z 2025-05-07T19:45:54.8126737Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:45:54.8127131Z 2025-05-07T19:45:54.8127161Z 2025-05-07T19:45:54.8127167Z 2025-05-07T19:45:54.8127173Z 2025-05-07T19:45:54.8127179Z 2025-05-07T19:45:54.8127185Z 2025-05-07T19:45:54.8127190Z 2025-05-07T19:45:54.8127196Z 2025-05-07T19:45:54.9152989Z cuda-nvdisasm-12.6.7 | 47.6 MB | #6 | 16%  2025-05-07T19:45:54.9153463Z 2025-05-07T19:45:54.9153471Z 2025-05-07T19:45:54.9153475Z 2025-05-07T19:45:54.9153480Z 2025-05-07T19:45:54.9153484Z 2025-05-07T19:45:54.9153487Z 2025-05-07T19:45:54.9153490Z 2025-05-07T19:45:54.9153496Z 2025-05-07T19:45:55.0161290Z cuda-nvdisasm-12.6.7 | 47.6 MB | ##9 | 29%  2025-05-07T19:45:55.0161709Z 2025-05-07T19:45:55.0161714Z 2025-05-07T19:45:55.0161717Z 2025-05-07T19:45:55.0161721Z 2025-05-07T19:45:55.0161724Z 2025-05-07T19:45:55.0161745Z 2025-05-07T19:45:55.0161750Z 2025-05-07T19:45:55.0161754Z 2025-05-07T19:45:55.1161768Z cuda-nvdisasm-12.6.7 | 47.6 MB | ####5 | 46%  2025-05-07T19:45:55.1162267Z 2025-05-07T19:45:55.1162273Z 2025-05-07T19:45:55.1162279Z 2025-05-07T19:45:55.1162285Z 2025-05-07T19:45:55.1162290Z 2025-05-07T19:45:55.1162320Z 2025-05-07T19:45:55.1162325Z 2025-05-07T19:45:55.1162332Z 2025-05-07T19:45:55.1238252Z cuda-nvdisasm-12.6.7 | 47.6 MB | ######6 | 66%  2025-05-07T19:45:55.1238685Z 2025-05-07T19:45:55.1238691Z 2025-05-07T19:45:55.1238698Z 2025-05-07T19:45:55.1438665Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:45:55.1438984Z 2025-05-07T19:45:55.1438989Z 2025-05-07T19:45:55.1438994Z 2025-05-07T19:45:55.1439249Z 2025-05-07T19:45:55.1439253Z 2025-05-07T19:45:55.1439256Z 2025-05-07T19:45:55.1846914Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:45:55.1847503Z 2025-05-07T19:45:55.1847513Z 2025-05-07T19:45:55.1847522Z 2025-05-07T19:45:55.1847530Z 2025-05-07T19:45:55.1847538Z 2025-05-07T19:45:55.1847546Z 2025-05-07T19:45:55.1847555Z 2025-05-07T19:45:55.1847564Z 2025-05-07T19:45:55.1847571Z 2025-05-07T19:45:55.2598838Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:45:55.2599267Z 2025-05-07T19:45:55.2599277Z 2025-05-07T19:45:55.2599285Z 2025-05-07T19:45:55.2599318Z 2025-05-07T19:45:55.2599327Z 2025-05-07T19:45:55.2599375Z 2025-05-07T19:45:55.2599383Z 2025-05-07T19:45:55.2599389Z 2025-05-07T19:45:55.2859745Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########2 | 82%  2025-05-07T19:45:55.2860194Z 2025-05-07T19:45:55.2860199Z 2025-05-07T19:45:55.2860204Z 2025-05-07T19:45:55.2860209Z 2025-05-07T19:45:55.2860252Z 2025-05-07T19:45:55.2860258Z 2025-05-07T19:45:55.2860264Z 2025-05-07T19:45:55.2860271Z 2025-05-07T19:45:55.2860279Z 2025-05-07T19:45:55.3599041Z libcurand-10.3.7.77 | 39.9 MB | #3 | 13%  2025-05-07T19:45:55.3599400Z 2025-05-07T19:45:55.3599405Z 2025-05-07T19:45:55.3599410Z 2025-05-07T19:45:55.3599415Z 2025-05-07T19:45:55.3599418Z 2025-05-07T19:45:55.3599423Z 2025-05-07T19:45:55.3599428Z 2025-05-07T19:45:55.3599432Z 2025-05-07T19:45:55.3864001Z cuda-nvdisasm-12.6.7 | 47.6 MB | #########8 | 98%  2025-05-07T19:45:55.3864383Z 2025-05-07T19:45:55.3864388Z 2025-05-07T19:45:55.3864393Z 2025-05-07T19:45:55.3864438Z 2025-05-07T19:45:55.3864442Z 2025-05-07T19:45:55.3864446Z 2025-05-07T19:45:55.3864449Z 2025-05-07T19:45:55.3864453Z 2025-05-07T19:45:55.3864457Z 2025-05-07T19:45:55.4541764Z libcurand-10.3.7.77 | 39.9 MB | ###4 | 34%  2025-05-07T19:45:55.4542186Z 2025-05-07T19:45:55.4542239Z 2025-05-07T19:45:55.4542243Z 2025-05-07T19:45:55.4542246Z 2025-05-07T19:45:55.4542250Z 2025-05-07T19:45:55.4542254Z 2025-05-07T19:45:55.4542259Z 2025-05-07T19:45:55.4864278Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:45:55.4864646Z 2025-05-07T19:45:55.4864651Z 2025-05-07T19:45:55.4864655Z 2025-05-07T19:45:55.4864659Z 2025-05-07T19:45:55.4864662Z 2025-05-07T19:45:55.4864665Z 2025-05-07T19:45:55.4864670Z 2025-05-07T19:45:55.4864675Z 2025-05-07T19:45:55.4864679Z 2025-05-07T19:45:55.4996862Z libcurand-10.3.7.77 | 39.9 MB | ####7 | 48%  2025-05-07T19:45:55.4997214Z 2025-05-07T19:45:55.4997480Z 2025-05-07T19:45:55.4997486Z 2025-05-07T19:45:55.4997490Z 2025-05-07T19:45:55.4997495Z 2025-05-07T19:45:55.4997500Z 2025-05-07T19:45:55.4997504Z 2025-05-07T19:45:55.4997508Z 2025-05-07T19:45:55.4997513Z 2025-05-07T19:45:55.4997517Z 2025-05-07T19:45:55.5879730Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:45:55.5880146Z 2025-05-07T19:45:55.5880152Z 2025-05-07T19:45:55.5880157Z 2025-05-07T19:45:55.5880162Z 2025-05-07T19:45:55.5880167Z 2025-05-07T19:45:55.5880172Z 2025-05-07T19:45:55.5880201Z 2025-05-07T19:45:55.5880206Z 2025-05-07T19:45:55.5880211Z 2025-05-07T19:45:55.5996858Z libcurand-10.3.7.77 | 39.9 MB | ######5 | 65%  2025-05-07T19:45:55.5997320Z 2025-05-07T19:45:55.5997374Z 2025-05-07T19:45:55.5997380Z 2025-05-07T19:45:55.5997387Z 2025-05-07T19:45:55.5997422Z 2025-05-07T19:45:55.5997430Z 2025-05-07T19:45:55.5997436Z 2025-05-07T19:45:55.5997450Z 2025-05-07T19:45:55.5997456Z 2025-05-07T19:45:55.5997501Z 2025-05-07T19:45:55.6998000Z gds-tools-1.11.1.6 | 37.8 MB | ##3 | 24%  2025-05-07T19:45:55.6998357Z 2025-05-07T19:45:55.6998363Z 2025-05-07T19:45:55.6998392Z 2025-05-07T19:45:55.6998398Z 2025-05-07T19:45:55.6998406Z 2025-05-07T19:45:55.6998409Z 2025-05-07T19:45:55.6998690Z 2025-05-07T19:45:55.6998694Z 2025-05-07T19:45:55.6998697Z 2025-05-07T19:45:55.6998702Z 2025-05-07T19:45:55.7112609Z gds-tools-1.11.1.6 | 37.8 MB | ####7 | 48%  2025-05-07T19:45:55.7113050Z 2025-05-07T19:45:55.7113056Z 2025-05-07T19:45:55.7113061Z 2025-05-07T19:45:55.7113090Z 2025-05-07T19:45:55.7113095Z 2025-05-07T19:45:55.7113100Z 2025-05-07T19:45:55.7113105Z 2025-05-07T19:45:55.7113109Z 2025-05-07T19:45:55.7113113Z 2025-05-07T19:45:55.8042253Z libcurand-10.3.7.77 | 39.9 MB | ######## | 80%  2025-05-07T19:45:55.8042633Z 2025-05-07T19:45:55.8042640Z 2025-05-07T19:45:55.8042647Z 2025-05-07T19:45:55.8042746Z 2025-05-07T19:45:55.8042751Z 2025-05-07T19:45:55.8042754Z 2025-05-07T19:45:55.8042759Z 2025-05-07T19:45:55.8042764Z 2025-05-07T19:45:55.8042768Z 2025-05-07T19:45:55.8042773Z 2025-05-07T19:45:55.8644533Z gds-tools-1.11.1.6 | 37.8 MB | ######6 | 66%  2025-05-07T19:45:55.8644955Z 2025-05-07T19:45:55.8644995Z 2025-05-07T19:45:55.9719052Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:45:55.9719405Z 2025-05-07T19:45:55.9719412Z 2025-05-07T19:45:55.9719418Z 2025-05-07T19:45:55.9719423Z 2025-05-07T19:45:55.9719430Z 2025-05-07T19:45:55.9719436Z 2025-05-07T19:45:55.9719468Z 2025-05-07T19:45:55.9719473Z 2025-05-07T19:45:56.0212737Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:45:56.0213118Z 2025-05-07T19:45:56.0213124Z 2025-05-07T19:45:56.0213132Z 2025-05-07T19:45:56.0213138Z 2025-05-07T19:45:56.0213145Z 2025-05-07T19:45:56.0213150Z 2025-05-07T19:45:56.0213156Z 2025-05-07T19:45:56.0213235Z 2025-05-07T19:45:56.0213239Z 2025-05-07T19:45:56.0213242Z 2025-05-07T19:45:56.0213246Z 2025-05-07T19:45:56.1119299Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:45:56.1119696Z 2025-05-07T19:45:56.1119702Z 2025-05-07T19:45:56.1119707Z 2025-05-07T19:45:56.1119757Z 2025-05-07T19:45:56.1119784Z 2025-05-07T19:45:56.1119787Z 2025-05-07T19:45:56.1119792Z 2025-05-07T19:45:56.1119797Z 2025-05-07T19:45:56.1119801Z 2025-05-07T19:45:56.1119806Z 2025-05-07T19:45:56.1216977Z gds-tools-1.11.1.6 | 37.8 MB | ########4 | 84%  2025-05-07T19:45:56.1217368Z 2025-05-07T19:45:56.1217398Z 2025-05-07T19:45:56.1217403Z 2025-05-07T19:45:56.1217408Z 2025-05-07T19:45:56.1217413Z 2025-05-07T19:45:56.1217417Z 2025-05-07T19:45:56.1217422Z 2025-05-07T19:45:56.1217426Z 2025-05-07T19:45:56.1217431Z 2025-05-07T19:45:56.1217437Z 2025-05-07T19:45:56.1217440Z 2025-05-07T19:45:56.2128065Z cuda-nvcc-tools-12.6 | 23.0 MB | ##9 | 29%  2025-05-07T19:45:56.2128490Z 2025-05-07T19:45:56.2128495Z 2025-05-07T19:45:56.2128499Z 2025-05-07T19:45:56.2128502Z 2025-05-07T19:45:56.2128505Z 2025-05-07T19:45:56.2128510Z 2025-05-07T19:45:56.2128538Z 2025-05-07T19:45:56.2128541Z 2025-05-07T19:45:56.2128569Z 2025-05-07T19:45:56.2128591Z 2025-05-07T19:45:56.2217297Z gds-tools-1.11.1.6 | 37.8 MB | #########9 | 99%  2025-05-07T19:45:56.2217750Z 2025-05-07T19:45:56.2217755Z 2025-05-07T19:45:56.2217759Z 2025-05-07T19:45:56.2217786Z 2025-05-07T19:45:56.2217789Z 2025-05-07T19:45:56.2217793Z 2025-05-07T19:45:56.2217796Z 2025-05-07T19:45:56.2217800Z 2025-05-07T19:45:56.2217803Z 2025-05-07T19:45:56.2217806Z 2025-05-07T19:45:56.2217810Z 2025-05-07T19:45:56.2423123Z cuda-nvcc-tools-12.6 | 23.0 MB | #####4 | 55%  2025-05-07T19:45:56.2423521Z 2025-05-07T19:45:56.2423526Z 2025-05-07T19:45:56.2423532Z 2025-05-07T19:45:56.2423571Z 2025-05-07T19:45:56.2423577Z 2025-05-07T19:45:56.2423580Z 2025-05-07T19:45:56.2423584Z 2025-05-07T19:45:56.2423587Z 2025-05-07T19:45:56.2423590Z 2025-05-07T19:45:56.2423880Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:45:56.2424209Z 2025-05-07T19:45:56.2424441Z 2025-05-07T19:45:56.2424445Z 2025-05-07T19:45:56.2424448Z 2025-05-07T19:45:56.2424452Z 2025-05-07T19:45:56.2424455Z 2025-05-07T19:45:56.2424459Z 2025-05-07T19:45:56.2424462Z 2025-05-07T19:45:56.2424478Z 2025-05-07T19:45:56.2785039Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:45:56.2785433Z 2025-05-07T19:45:56.2785457Z 2025-05-07T19:45:56.2785463Z 2025-05-07T19:45:56.2785468Z 2025-05-07T19:45:56.2785473Z 2025-05-07T19:45:56.2785477Z 2025-05-07T19:45:56.2785482Z 2025-05-07T19:45:56.2785488Z 2025-05-07T19:45:56.2785492Z 2025-05-07T19:45:56.2785497Z 2025-05-07T19:45:56.2785502Z 2025-05-07T19:45:56.2785506Z 2025-05-07T19:45:56.3219550Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:45:56.3219961Z 2025-05-07T19:45:56.3219986Z 2025-05-07T19:45:56.3219991Z 2025-05-07T19:45:56.3219995Z 2025-05-07T19:45:56.3220002Z 2025-05-07T19:45:56.3220005Z 2025-05-07T19:45:56.3220009Z 2025-05-07T19:45:56.3220034Z 2025-05-07T19:45:56.3220037Z 2025-05-07T19:45:56.3220041Z 2025-05-07T19:45:56.3220044Z 2025-05-07T19:45:56.3785550Z cuda-nvcc-tools-12.6 | 23.0 MB | #########6 | 96%  2025-05-07T19:45:56.3785957Z 2025-05-07T19:45:56.3785986Z 2025-05-07T19:45:56.3785991Z 2025-05-07T19:45:56.3785994Z 2025-05-07T19:45:56.3785997Z 2025-05-07T19:45:56.3786003Z 2025-05-07T19:45:56.3786008Z 2025-05-07T19:45:56.3786012Z 2025-05-07T19:45:56.3786016Z 2025-05-07T19:45:56.3786021Z 2025-05-07T19:45:56.3786025Z 2025-05-07T19:45:56.3786028Z 2025-05-07T19:45:56.5928912Z cuda-nvrtc-12.6.85 | 17.3 MB | #####8 | 58%  2025-05-07T19:45:56.5929304Z 2025-05-07T19:45:56.5929309Z 2025-05-07T19:45:56.5929313Z 2025-05-07T19:45:56.5929316Z 2025-05-07T19:45:56.5929320Z 2025-05-07T19:45:56.5929323Z 2025-05-07T19:45:56.5929329Z 2025-05-07T19:45:56.5929339Z 2025-05-07T19:45:56.5929344Z 2025-05-07T19:45:56.5929373Z 2025-05-07T19:45:56.5929401Z 2025-05-07T19:45:56.6118712Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:45:56.6119088Z 2025-05-07T19:45:56.6273423Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:45:56.6273792Z 2025-05-07T19:45:56.6273817Z 2025-05-07T19:45:56.6273820Z 2025-05-07T19:45:56.6273824Z 2025-05-07T19:45:56.6273828Z 2025-05-07T19:45:56.6273834Z 2025-05-07T19:45:56.6273838Z 2025-05-07T19:45:56.6273842Z 2025-05-07T19:45:56.6273845Z 2025-05-07T19:45:56.6273848Z 2025-05-07T19:45:56.6273854Z 2025-05-07T19:45:56.6273858Z 2025-05-07T19:45:56.6273861Z 2025-05-07T19:45:56.6308047Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:45:56.6308404Z 2025-05-07T19:45:56.6308408Z 2025-05-07T19:45:56.6308411Z 2025-05-07T19:45:56.6308416Z 2025-05-07T19:45:56.6308421Z 2025-05-07T19:45:56.6308427Z 2025-05-07T19:45:56.6308432Z 2025-05-07T19:45:56.6308436Z 2025-05-07T19:45:56.6308480Z 2025-05-07T19:45:56.6308484Z 2025-05-07T19:45:56.6308487Z 2025-05-07T19:45:56.6308491Z 2025-05-07T19:45:56.6308790Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:45:56.6309105Z 2025-05-07T19:45:56.6309108Z 2025-05-07T19:45:56.6309112Z 2025-05-07T19:45:56.6309115Z 2025-05-07T19:45:56.6309119Z 2025-05-07T19:45:56.6309123Z 2025-05-07T19:45:56.6309145Z 2025-05-07T19:45:56.6309149Z 2025-05-07T19:45:56.6309152Z 2025-05-07T19:45:56.6309155Z 2025-05-07T19:45:56.6309159Z 2025-05-07T19:45:56.6309162Z 2025-05-07T19:45:56.6519557Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:45:56.6519946Z 2025-05-07T19:45:56.6519996Z 2025-05-07T19:45:56.6520001Z 2025-05-07T19:45:56.6520005Z 2025-05-07T19:45:56.6520008Z 2025-05-07T19:45:56.6520013Z 2025-05-07T19:45:56.6520016Z 2025-05-07T19:45:56.6520021Z 2025-05-07T19:45:56.6520024Z 2025-05-07T19:45:56.6520029Z 2025-05-07T19:45:56.6711986Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:45:56.6712669Z 2025-05-07T19:45:56.6712675Z 2025-05-07T19:45:56.6712680Z 2025-05-07T19:45:56.6712684Z 2025-05-07T19:45:56.6712689Z 2025-05-07T19:45:56.6712693Z 2025-05-07T19:45:56.6712698Z 2025-05-07T19:45:56.6712702Z 2025-05-07T19:45:56.6712706Z 2025-05-07T19:45:56.6712711Z 2025-05-07T19:45:56.6712714Z 2025-05-07T19:45:56.6712718Z 2025-05-07T19:45:56.6712721Z 2025-05-07T19:45:56.6712725Z 2025-05-07T19:45:56.6712728Z 2025-05-07T19:45:56.6923972Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:45:56.6924379Z 2025-05-07T19:45:56.6924416Z 2025-05-07T19:45:56.6924420Z 2025-05-07T19:45:56.6924423Z 2025-05-07T19:45:56.6924427Z 2025-05-07T19:45:56.6924454Z 2025-05-07T19:45:56.6924458Z 2025-05-07T19:45:56.6924462Z 2025-05-07T19:45:56.6924466Z 2025-05-07T19:45:56.6924473Z 2025-05-07T19:45:56.6924478Z 2025-05-07T19:45:56.6924482Z 2025-05-07T19:45:56.6924507Z 2025-05-07T19:45:56.6924510Z 2025-05-07T19:45:56.6924514Z 2025-05-07T19:45:56.6924612Z 2025-05-07T19:45:56.6929411Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:45:56.6929787Z 2025-05-07T19:45:56.6929800Z 2025-05-07T19:45:56.6929804Z 2025-05-07T19:45:56.6929808Z 2025-05-07T19:45:56.6930562Z 2025-05-07T19:45:56.6940135Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:45:56.6940465Z 2025-05-07T19:45:56.6940468Z 2025-05-07T19:45:56.6940473Z 2025-05-07T19:45:56.6940477Z 2025-05-07T19:45:56.6940481Z 2025-05-07T19:45:56.6940485Z 2025-05-07T19:45:56.6940508Z 2025-05-07T19:45:56.6940512Z 2025-05-07T19:45:56.6940525Z 2025-05-07T19:45:56.6940529Z 2025-05-07T19:45:56.6940532Z 2025-05-07T19:45:56.6940535Z 2025-05-07T19:45:56.6940539Z 2025-05-07T19:45:56.6941345Z 2025-05-07T19:45:56.7271985Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:45:56.7272433Z 2025-05-07T19:45:56.7272437Z 2025-05-07T19:45:56.7272441Z 2025-05-07T19:45:56.7272444Z 2025-05-07T19:45:56.7272448Z 2025-05-07T19:45:56.7272451Z 2025-05-07T19:45:56.7272455Z 2025-05-07T19:45:56.7272458Z 2025-05-07T19:45:56.7272463Z 2025-05-07T19:45:56.7272467Z 2025-05-07T19:45:56.7272472Z 2025-05-07T19:45:56.7272476Z 2025-05-07T19:45:56.7272481Z 2025-05-07T19:45:56.7715110Z libnvjitlink-12.6.85 | 14.9 MB | ######8 | 68%  2025-05-07T19:45:56.7715523Z 2025-05-07T19:45:56.7715528Z 2025-05-07T19:45:56.7715533Z 2025-05-07T19:45:56.7715537Z 2025-05-07T19:45:56.7715540Z 2025-05-07T19:45:56.7715755Z 2025-05-07T19:45:56.7715759Z 2025-05-07T19:45:56.7715763Z 2025-05-07T19:45:56.7715766Z 2025-05-07T19:45:56.7715795Z 2025-05-07T19:45:56.7715798Z 2025-05-07T19:45:56.7715802Z 2025-05-07T19:45:56.7715807Z 2025-05-07T19:45:56.7715810Z 2025-05-07T19:45:56.7715815Z 2025-05-07T19:45:56.7946558Z cuda-nvvm-tools-12.6 | 10.4 MB | ######2 | 62%  2025-05-07T19:45:56.7946965Z 2025-05-07T19:45:56.7946971Z 2025-05-07T19:45:56.7946974Z 2025-05-07T19:45:56.7947008Z 2025-05-07T19:45:56.7947012Z 2025-05-07T19:45:56.7947016Z 2025-05-07T19:45:56.7947020Z 2025-05-07T19:45:56.7947023Z 2025-05-07T19:45:56.7947028Z 2025-05-07T19:45:56.7947031Z 2025-05-07T19:45:56.7947036Z 2025-05-07T19:45:56.7947039Z 2025-05-07T19:45:56.7947042Z 2025-05-07T19:45:56.7947046Z 2025-05-07T19:45:56.7988355Z cuda-nvcc-dev_linux- | 10.8 MB | ####3 | 43%  2025-05-07T19:45:56.7988836Z 2025-05-07T19:45:56.7988841Z 2025-05-07T19:45:56.7988868Z 2025-05-07T19:45:56.7988872Z 2025-05-07T19:45:56.7988876Z 2025-05-07T19:45:56.7988879Z 2025-05-07T19:45:56.7988883Z 2025-05-07T19:45:56.7988886Z 2025-05-07T19:45:56.7988889Z 2025-05-07T19:45:56.7988893Z 2025-05-07T19:45:56.7988896Z 2025-05-07T19:45:56.7988900Z 2025-05-07T19:45:56.7988903Z 2025-05-07T19:45:56.7989102Z 2025-05-07T19:45:56.7989105Z 2025-05-07T19:45:56.7989120Z 2025-05-07T19:45:56.9480609Z cuda-sanitizer-api-1 | 8.9 MB | ####9 | 49%  2025-05-07T19:45:56.9481069Z 2025-05-07T19:45:56.9481073Z 2025-05-07T19:45:56.9481078Z 2025-05-07T19:45:56.9481083Z 2025-05-07T19:45:56.9481087Z 2025-05-07T19:45:56.9481095Z 2025-05-07T19:45:56.9481100Z 2025-05-07T19:45:56.9481105Z 2025-05-07T19:45:56.9481111Z 2025-05-07T19:45:56.9481115Z 2025-05-07T19:45:56.9481120Z 2025-05-07T19:45:56.9481150Z 2025-05-07T19:45:56.9481155Z 2025-05-07T19:45:56.9481160Z 2025-05-07T19:45:56.9481183Z 2025-05-07T19:45:56.9481548Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:45:56.9481886Z 2025-05-07T19:45:56.9481890Z 2025-05-07T19:45:56.9481894Z 2025-05-07T19:45:56.9481919Z 2025-05-07T19:45:56.9481924Z 2025-05-07T19:45:56.9481928Z 2025-05-07T19:45:56.9481933Z 2025-05-07T19:45:56.9481956Z 2025-05-07T19:45:56.9481960Z 2025-05-07T19:45:56.9481963Z 2025-05-07T19:45:56.9481967Z 2025-05-07T19:45:56.9481970Z 2025-05-07T19:45:56.9481974Z 2025-05-07T19:45:56.9481977Z 2025-05-07T19:45:56.9481980Z 2025-05-07T19:45:56.9679845Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:45:56.9680262Z 2025-05-07T19:45:56.9680266Z 2025-05-07T19:45:56.9680270Z 2025-05-07T19:45:56.9680274Z 2025-05-07T19:45:56.9680279Z 2025-05-07T19:45:56.9680283Z 2025-05-07T19:45:56.9680288Z 2025-05-07T19:45:56.9680294Z 2025-05-07T19:45:56.9680300Z 2025-05-07T19:45:56.9680305Z 2025-05-07T19:45:56.9680309Z 2025-05-07T19:45:56.9680342Z 2025-05-07T19:45:56.9680347Z 2025-05-07T19:45:56.9680351Z 2025-05-07T19:45:56.9680696Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:45:56.9681034Z 2025-05-07T19:45:56.9681039Z 2025-05-07T19:45:56.9681043Z 2025-05-07T19:45:56.9681046Z 2025-05-07T19:45:56.9681067Z 2025-05-07T19:45:56.9681071Z 2025-05-07T19:45:56.9681074Z 2025-05-07T19:45:56.9681078Z 2025-05-07T19:45:56.9681081Z 2025-05-07T19:45:56.9681085Z 2025-05-07T19:45:56.9681088Z 2025-05-07T19:45:56.9681091Z 2025-05-07T19:45:56.9681095Z 2025-05-07T19:45:56.9681112Z 2025-05-07T19:45:56.9755385Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:45:56.9755948Z 2025-05-07T19:45:56.9755953Z 2025-05-07T19:45:56.9755957Z 2025-05-07T19:45:56.9755960Z 2025-05-07T19:45:56.9755964Z 2025-05-07T19:45:56.9755967Z 2025-05-07T19:45:56.9755971Z 2025-05-07T19:45:56.9755974Z 2025-05-07T19:45:56.9756004Z 2025-05-07T19:45:56.9756230Z 2025-05-07T19:45:56.9756234Z 2025-05-07T19:45:56.9756237Z 2025-05-07T19:45:56.9756248Z 2025-05-07T19:45:56.9812421Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:45:56.9812887Z 2025-05-07T19:45:56.9812892Z 2025-05-07T19:45:56.9812895Z 2025-05-07T19:45:56.9812922Z 2025-05-07T19:45:56.9812925Z 2025-05-07T19:45:56.9812930Z 2025-05-07T19:45:56.9812934Z 2025-05-07T19:45:56.9812938Z 2025-05-07T19:45:56.9812941Z 2025-05-07T19:45:56.9812945Z 2025-05-07T19:45:56.9812948Z 2025-05-07T19:45:56.9812952Z 2025-05-07T19:45:56.9812955Z 2025-05-07T19:45:56.9812959Z 2025-05-07T19:45:56.9812962Z 2025-05-07T19:45:56.9812966Z 2025-05-07T19:45:56.9813314Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:45:56.9813693Z 2025-05-07T19:45:56.9813697Z 2025-05-07T19:45:56.9813701Z 2025-05-07T19:45:56.9813708Z 2025-05-07T19:45:56.9813711Z 2025-05-07T19:45:56.9813714Z 2025-05-07T19:45:56.9813728Z 2025-05-07T19:45:56.9813731Z 2025-05-07T19:45:56.9813735Z 2025-05-07T19:45:56.9813738Z 2025-05-07T19:45:56.9813742Z 2025-05-07T19:45:56.9813745Z 2025-05-07T19:45:56.9813749Z 2025-05-07T19:45:56.9813752Z 2025-05-07T19:45:56.9813755Z 2025-05-07T19:45:56.9813759Z 2025-05-07T19:45:56.9838821Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:45:56.9839421Z 2025-05-07T19:45:56.9839425Z 2025-05-07T19:45:56.9839429Z 2025-05-07T19:45:56.9839432Z 2025-05-07T19:45:56.9839436Z 2025-05-07T19:45:56.9839439Z 2025-05-07T19:45:56.9839442Z 2025-05-07T19:45:56.9839446Z 2025-05-07T19:45:56.9839473Z 2025-05-07T19:45:56.9839477Z 2025-05-07T19:45:56.9839481Z 2025-05-07T19:45:56.9839484Z 2025-05-07T19:45:56.9839488Z 2025-05-07T19:45:56.9839491Z 2025-05-07T19:45:56.9839495Z 2025-05-07T19:45:56.9839498Z 2025-05-07T19:45:56.9839502Z 2025-05-07T19:45:57.0187144Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:45:57.0187568Z 2025-05-07T19:45:57.0187573Z 2025-05-07T19:45:57.0187578Z 2025-05-07T19:45:57.0187582Z 2025-05-07T19:45:57.0187587Z 2025-05-07T19:45:57.0187591Z 2025-05-07T19:45:57.0187595Z 2025-05-07T19:45:57.0187599Z 2025-05-07T19:45:57.0187602Z 2025-05-07T19:45:57.0187633Z 2025-05-07T19:45:57.0187636Z 2025-05-07T19:45:57.0187640Z 2025-05-07T19:45:57.0187643Z 2025-05-07T19:45:57.0187646Z 2025-05-07T19:45:57.0187650Z 2025-05-07T19:45:57.0187653Z 2025-05-07T19:45:57.0187656Z 2025-05-07T19:45:57.0187660Z 2025-05-07T19:45:57.0545484Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:45:57.0545875Z 2025-05-07T19:45:57.0545881Z 2025-05-07T19:45:57.0545888Z 2025-05-07T19:45:57.0545893Z 2025-05-07T19:45:57.0545896Z 2025-05-07T19:45:57.0545901Z 2025-05-07T19:45:57.0545904Z 2025-05-07T19:45:57.0545907Z 2025-05-07T19:45:57.0545911Z 2025-05-07T19:45:57.0545944Z 2025-05-07T19:45:57.0545948Z 2025-05-07T19:45:57.0545974Z 2025-05-07T19:45:57.0545978Z 2025-05-07T19:45:57.0545981Z 2025-05-07T19:45:57.0545985Z 2025-05-07T19:45:57.0545988Z 2025-05-07T19:45:57.0545991Z 2025-05-07T19:45:57.0545997Z 2025-05-07T19:45:57.0546000Z 2025-05-07T19:45:57.1044733Z ... (more hidden) ... 2025-05-07T19:45:57.1045167Z 2025-05-07T19:45:57.1045172Z 2025-05-07T19:45:57.1045176Z 2025-05-07T19:45:57.1045181Z 2025-05-07T19:45:57.1045186Z 2025-05-07T19:45:57.1045190Z 2025-05-07T19:45:57.1045195Z 2025-05-07T19:45:57.1045199Z 2025-05-07T19:45:57.1045204Z 2025-05-07T19:45:57.1045210Z 2025-05-07T19:45:57.1045215Z 2025-05-07T19:45:57.1045219Z 2025-05-07T19:45:57.1045224Z 2025-05-07T19:45:57.1045229Z 2025-05-07T19:45:57.1045232Z 2025-05-07T19:45:57.1045235Z 2025-05-07T19:45:57.1045240Z 2025-05-07T19:45:57.1045243Z 2025-05-07T19:45:57.1106776Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:45:57.1107163Z 2025-05-07T19:45:57.1107168Z 2025-05-07T19:45:57.1107172Z 2025-05-07T19:45:57.1107175Z 2025-05-07T19:45:57.1107178Z 2025-05-07T19:45:57.1107181Z 2025-05-07T19:45:57.1107185Z 2025-05-07T19:45:57.1107188Z 2025-05-07T19:45:57.1107192Z 2025-05-07T19:45:57.1107207Z 2025-05-07T19:45:57.1107211Z 2025-05-07T19:45:57.1107215Z 2025-05-07T19:45:57.1107218Z 2025-05-07T19:45:57.1107222Z 2025-05-07T19:45:57.1107248Z 2025-05-07T19:45:57.1107252Z 2025-05-07T19:45:57.1107255Z 2025-05-07T19:45:57.1107258Z 2025-05-07T19:45:57.1107272Z 2025-05-07T19:45:57.1333456Z ... (more hidden) ... 2025-05-07T19:45:57.1333812Z 2025-05-07T19:45:57.1333819Z 2025-05-07T19:45:57.1333824Z 2025-05-07T19:45:57.1333855Z 2025-05-07T19:45:57.1333861Z 2025-05-07T19:45:57.1333864Z 2025-05-07T19:45:57.1333868Z 2025-05-07T19:45:57.1333874Z 2025-05-07T19:45:57.1353904Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:45:57.1354244Z 2025-05-07T19:45:57.1354249Z 2025-05-07T19:45:57.1354279Z 2025-05-07T19:45:57.1354284Z 2025-05-07T19:45:57.1354289Z 2025-05-07T19:45:57.1354292Z 2025-05-07T19:45:57.1354298Z 2025-05-07T19:45:57.1354303Z 2025-05-07T19:45:57.1354307Z 2025-05-07T19:45:57.1354545Z 2025-05-07T19:45:57.1354549Z 2025-05-07T19:45:57.1354552Z 2025-05-07T19:45:57.1354556Z 2025-05-07T19:45:57.1354559Z 2025-05-07T19:45:57.1354562Z 2025-05-07T19:45:57.1354566Z 2025-05-07T19:45:57.1354569Z 2025-05-07T19:45:57.1354930Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:45:57.1355294Z 2025-05-07T19:45:57.1355298Z 2025-05-07T19:45:57.1355301Z 2025-05-07T19:45:57.1355304Z 2025-05-07T19:45:57.1355308Z 2025-05-07T19:45:57.1355312Z 2025-05-07T19:45:57.1355315Z 2025-05-07T19:45:57.1355319Z 2025-05-07T19:45:57.1355322Z 2025-05-07T19:45:57.1355326Z 2025-05-07T19:45:57.1355329Z 2025-05-07T19:45:57.1355339Z 2025-05-07T19:45:57.1355342Z 2025-05-07T19:45:57.1355346Z 2025-05-07T19:45:57.1355349Z 2025-05-07T19:45:57.1355353Z 2025-05-07T19:45:57.1355378Z 2025-05-07T19:45:57.2924332Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:45:57.2924792Z 2025-05-07T19:45:57.2924846Z 2025-05-07T19:45:57.2924851Z 2025-05-07T19:45:57.2924855Z 2025-05-07T19:45:57.2924858Z 2025-05-07T19:45:57.2924880Z 2025-05-07T19:45:57.5556375Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:45:57.5556826Z 2025-05-07T19:45:57.5556833Z 2025-05-07T19:45:57.5556839Z 2025-05-07T19:45:57.5556844Z 2025-05-07T19:45:57.5556851Z 2025-05-07T19:45:57.5556858Z 2025-05-07T19:45:57.5556865Z 2025-05-07T19:45:57.5556874Z 2025-05-07T19:45:57.5556883Z 2025-05-07T19:45:57.7236864Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:45:57.7237307Z 2025-05-07T19:45:57.7237315Z 2025-05-07T19:45:57.7237378Z 2025-05-07T19:45:57.7237386Z 2025-05-07T19:45:57.7237393Z 2025-05-07T19:45:57.7237402Z 2025-05-07T19:45:57.7237409Z 2025-05-07T19:45:57.7785342Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:45:57.7785790Z 2025-05-07T19:45:57.7785798Z 2025-05-07T19:45:57.7785854Z 2025-05-07T19:45:57.7785859Z 2025-05-07T19:45:57.7785865Z 2025-05-07T19:45:57.7785872Z 2025-05-07T19:45:57.7785879Z 2025-05-07T19:45:57.7785886Z 2025-05-07T19:45:57.7785892Z 2025-05-07T19:45:57.7785898Z 2025-05-07T19:45:57.7785926Z 2025-05-07T19:45:58.0064856Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:45:58.0065244Z 2025-05-07T19:45:58.0065252Z 2025-05-07T19:45:58.0065257Z 2025-05-07T19:45:58.0065262Z 2025-05-07T19:45:58.0065267Z 2025-05-07T19:45:58.0065270Z 2025-05-07T19:45:58.0065275Z 2025-05-07T19:45:58.0065280Z 2025-05-07T19:45:58.0065283Z 2025-05-07T19:45:58.0065287Z 2025-05-07T19:45:58.0411675Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:45:58.0412190Z 2025-05-07T19:45:58.0412195Z 2025-05-07T19:45:58.0412200Z 2025-05-07T19:45:58.0412204Z 2025-05-07T19:45:58.0412209Z 2025-05-07T19:45:58.0412215Z 2025-05-07T19:45:58.0412220Z 2025-05-07T19:45:58.0412224Z 2025-05-07T19:45:58.0412252Z 2025-05-07T19:45:58.0412256Z 2025-05-07T19:45:58.0412259Z 2025-05-07T19:45:58.0412284Z 2025-05-07T19:45:58.2002183Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:45:58.2002696Z 2025-05-07T19:45:58.2002705Z 2025-05-07T19:45:58.2002737Z 2025-05-07T19:45:58.2002742Z 2025-05-07T19:45:58.2002746Z 2025-05-07T19:45:58.2002752Z 2025-05-07T19:45:58.2002758Z 2025-05-07T19:45:58.2002769Z 2025-05-07T19:45:58.2002780Z 2025-05-07T19:45:58.2002790Z 2025-05-07T19:45:58.2002798Z 2025-05-07T19:45:58.2002804Z 2025-05-07T19:45:58.2002810Z 2025-05-07T19:45:58.2002815Z 2025-05-07T19:45:58.2002823Z 2025-05-07T19:45:58.2883576Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:45:58.2884118Z 2025-05-07T19:45:58.2884127Z 2025-05-07T19:45:58.2884133Z 2025-05-07T19:45:58.2884141Z 2025-05-07T19:45:58.2884146Z 2025-05-07T19:45:58.2884155Z 2025-05-07T19:45:58.2884162Z 2025-05-07T19:45:58.2884450Z 2025-05-07T19:45:58.2884455Z 2025-05-07T19:45:58.2884459Z 2025-05-07T19:45:58.2884462Z 2025-05-07T19:45:58.2884465Z 2025-05-07T19:45:58.2884469Z 2025-05-07T19:45:58.2884472Z 2025-05-07T19:45:58.4325131Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:45:58.4325839Z 2025-05-07T19:45:58.4325845Z 2025-05-07T19:45:58.4325850Z 2025-05-07T19:45:58.4325855Z 2025-05-07T19:45:58.4325860Z 2025-05-07T19:45:58.4325863Z 2025-05-07T19:45:58.4325869Z 2025-05-07T19:45:58.4325875Z 2025-05-07T19:45:58.4325880Z 2025-05-07T19:45:58.4325884Z 2025-05-07T19:45:58.4325888Z 2025-05-07T19:45:58.4325892Z 2025-05-07T19:45:58.4325965Z 2025-05-07T19:45:58.4621113Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:45:58.4621509Z 2025-05-07T19:45:58.4621516Z 2025-05-07T19:45:58.4621526Z 2025-05-07T19:45:58.4621534Z 2025-05-07T19:45:58.4621543Z 2025-05-07T19:45:58.4621613Z 2025-05-07T19:45:58.4621620Z 2025-05-07T19:45:58.4621660Z 2025-05-07T19:45:58.4621670Z 2025-05-07T19:45:58.4621678Z 2025-05-07T19:45:58.4621685Z 2025-05-07T19:45:58.4621694Z 2025-05-07T19:45:58.4621703Z 2025-05-07T19:45:58.4621708Z 2025-05-07T19:45:58.4621739Z 2025-05-07T19:45:58.4621745Z 2025-05-07T19:45:58.5312781Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:45:58.5313267Z 2025-05-07T19:45:58.5313273Z 2025-05-07T19:45:58.5313278Z 2025-05-07T19:45:58.5313282Z 2025-05-07T19:45:58.5313287Z 2025-05-07T19:45:58.5313292Z 2025-05-07T19:45:58.5313296Z 2025-05-07T19:45:58.5313301Z 2025-05-07T19:45:58.5313345Z 2025-05-07T19:45:58.5313350Z 2025-05-07T19:45:58.5313357Z 2025-05-07T19:45:58.5313362Z 2025-05-07T19:45:58.5313365Z 2025-05-07T19:45:58.5313369Z 2025-05-07T19:45:58.5313372Z 2025-05-07T19:45:58.5313377Z 2025-05-07T19:45:58.5313382Z 2025-05-07T19:45:58.5313386Z 2025-05-07T19:45:58.5313391Z 2025-05-07T19:45:58.5313780Z ... (more hidden) ... 2025-05-07T19:45:58.5314266Z 2025-05-07T19:45:58.5314273Z 2025-05-07T19:45:58.5314278Z 2025-05-07T19:45:58.5314284Z 2025-05-07T19:45:58.5314290Z 2025-05-07T19:45:58.5314296Z 2025-05-07T19:45:58.5314301Z 2025-05-07T19:45:58.5314307Z 2025-05-07T19:45:58.5314313Z 2025-05-07T19:45:58.5314319Z 2025-05-07T19:45:58.5314324Z 2025-05-07T19:45:58.5314354Z 2025-05-07T19:45:58.5314359Z 2025-05-07T19:45:58.5314365Z 2025-05-07T19:45:58.5314371Z 2025-05-07T19:45:58.5314378Z 2025-05-07T19:45:58.5314384Z 2025-05-07T19:45:58.5314390Z 2025-05-07T19:45:58.5314396Z 2025-05-07T19:45:58.5580392Z ... (more hidden) ... 2025-05-07T19:45:58.5729293Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:45:58.5729830Z 2025-05-07T19:45:58.5729840Z 2025-05-07T19:45:58.5729853Z 2025-05-07T19:45:58.5729860Z 2025-05-07T19:45:58.5729866Z 2025-05-07T19:45:58.5729924Z 2025-05-07T19:45:58.5729932Z 2025-05-07T19:45:58.5729938Z 2025-05-07T19:45:58.5729946Z 2025-05-07T19:45:58.5729980Z 2025-05-07T19:45:58.5729986Z 2025-05-07T19:45:58.5729993Z 2025-05-07T19:45:58.5729999Z 2025-05-07T19:45:58.5730006Z 2025-05-07T19:45:58.5730013Z 2025-05-07T19:45:58.5730021Z 2025-05-07T19:45:58.5730027Z 2025-05-07T19:45:58.5730034Z 2025-05-07T19:45:58.5730848Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:45:58.5731501Z 2025-05-07T19:45:58.5731525Z 2025-05-07T19:45:58.5731531Z 2025-05-07T19:45:58.5731537Z 2025-05-07T19:45:58.5731544Z 2025-05-07T19:45:58.5731551Z 2025-05-07T19:45:58.5731583Z 2025-05-07T19:45:58.5731590Z 2025-05-07T19:45:58.5731596Z 2025-05-07T19:45:58.5731603Z 2025-05-07T19:45:58.5731610Z 2025-05-07T19:45:58.5731617Z 2025-05-07T19:45:58.5731623Z 2025-05-07T19:45:58.5731628Z 2025-05-07T19:45:58.5731633Z 2025-05-07T19:45:58.5731638Z 2025-05-07T19:45:58.5731643Z 2025-05-07T19:45:58.5731918Z 2025-05-07T19:45:58.6567673Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:45:58.6568378Z 2025-05-07T19:45:58.6568386Z 2025-05-07T19:45:58.6568395Z 2025-05-07T19:45:58.6568402Z 2025-05-07T19:45:58.6568409Z 2025-05-07T19:45:58.6568417Z 2025-05-07T19:45:58.6568425Z 2025-05-07T19:45:58.6568433Z 2025-05-07T19:45:58.6568441Z 2025-05-07T19:45:58.6568473Z 2025-05-07T19:45:58.6568508Z 2025-05-07T19:45:58.6568516Z 2025-05-07T19:45:58.6568524Z 2025-05-07T19:45:58.6568532Z 2025-05-07T19:45:58.6568540Z 2025-05-07T19:45:58.6568548Z 2025-05-07T19:45:58.6568556Z 2025-05-07T19:46:00.3417330Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:00.3417782Z 2025-05-07T19:46:02.7528674Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:02.7534541Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:02.7534870Z 2025-05-07T19:46:02.7534946Z 2025-05-07T19:46:02.7534952Z 2025-05-07T19:46:02.7534957Z 2025-05-07T19:46:02.7534962Z 2025-05-07T19:46:02.7534967Z 2025-05-07T19:46:02.7534972Z 2025-05-07T19:46:02.7534977Z 2025-05-07T19:46:02.7534982Z 2025-05-07T19:46:02.7534986Z 2025-05-07T19:46:02.7534991Z 2025-05-07T19:46:02.7534996Z 2025-05-07T19:46:02.7535001Z 2025-05-07T19:46:02.7535009Z 2025-05-07T19:46:02.7535012Z 2025-05-07T19:46:02.7535017Z 2025-05-07T19:46:02.7535021Z 2025-05-07T19:46:02.7535024Z 2025-05-07T19:46:02.7535028Z 2025-05-07T19:46:02.7535166Z 2025-05-07T19:46:02.7535531Z  2025-05-07T19:46:02.7535961Z 2025-05-07T19:46:02.7536184Z 2025-05-07T19:46:02.7536372Z  2025-05-07T19:46:02.7536623Z 2025-05-07T19:46:02.7536627Z 2025-05-07T19:46:02.7536819Z  2025-05-07T19:46:02.7537048Z 2025-05-07T19:46:02.7537052Z 2025-05-07T19:46:02.7537056Z 2025-05-07T19:46:02.7537281Z  2025-05-07T19:46:02.7537513Z 2025-05-07T19:46:02.7537516Z 2025-05-07T19:46:02.7537520Z 2025-05-07T19:46:02.7537524Z 2025-05-07T19:46:02.7537760Z  2025-05-07T19:46:02.7537992Z 2025-05-07T19:46:02.7537995Z 2025-05-07T19:46:02.7537999Z 2025-05-07T19:46:02.7538002Z 2025-05-07T19:46:02.7538006Z 2025-05-07T19:46:02.7538202Z  2025-05-07T19:46:02.7538470Z 2025-05-07T19:46:02.7538705Z 2025-05-07T19:46:02.7538711Z 2025-05-07T19:46:02.7538716Z 2025-05-07T19:46:02.7538719Z 2025-05-07T19:46:02.7538723Z 2025-05-07T19:46:02.7538953Z  2025-05-07T19:46:02.7539224Z 2025-05-07T19:46:02.7539228Z 2025-05-07T19:46:02.7539231Z 2025-05-07T19:46:02.7539245Z 2025-05-07T19:46:02.7539248Z 2025-05-07T19:46:02.7539252Z 2025-05-07T19:46:02.7539256Z 2025-05-07T19:46:02.7539453Z  2025-05-07T19:46:02.7539699Z 2025-05-07T19:46:02.7539703Z 2025-05-07T19:46:02.7539732Z 2025-05-07T19:46:02.7539736Z 2025-05-07T19:46:02.7539740Z 2025-05-07T19:46:02.7539743Z 2025-05-07T19:46:02.7539746Z 2025-05-07T19:46:02.7539750Z 2025-05-07T19:46:02.7539952Z  2025-05-07T19:46:02.7540188Z 2025-05-07T19:46:02.7540191Z 2025-05-07T19:46:02.7540195Z 2025-05-07T19:46:02.7540199Z 2025-05-07T19:46:02.7540233Z 2025-05-07T19:46:02.7540237Z 2025-05-07T19:46:02.7540241Z 2025-05-07T19:46:02.7540245Z 2025-05-07T19:46:02.7540248Z 2025-05-07T19:46:02.7540471Z  2025-05-07T19:46:02.7540714Z 2025-05-07T19:46:02.7540743Z 2025-05-07T19:46:02.7540878Z 2025-05-07T19:46:02.7540882Z 2025-05-07T19:46:02.7540885Z 2025-05-07T19:46:02.7540888Z 2025-05-07T19:46:02.7540892Z 2025-05-07T19:46:02.7540895Z 2025-05-07T19:46:02.7540899Z 2025-05-07T19:46:02.7540902Z 2025-05-07T19:46:02.7541112Z  2025-05-07T19:46:02.7541357Z 2025-05-07T19:46:02.7541392Z 2025-05-07T19:46:02.7541396Z 2025-05-07T19:46:02.7541399Z 2025-05-07T19:46:02.7541402Z 2025-05-07T19:46:02.7541406Z 2025-05-07T19:46:02.7541409Z 2025-05-07T19:46:02.7541413Z 2025-05-07T19:46:02.7541416Z 2025-05-07T19:46:02.7541420Z 2025-05-07T19:46:02.7541423Z 2025-05-07T19:46:02.7541638Z  2025-05-07T19:46:02.7541922Z 2025-05-07T19:46:02.7541925Z 2025-05-07T19:46:02.7541929Z 2025-05-07T19:46:02.7541932Z 2025-05-07T19:46:02.7541935Z 2025-05-07T19:46:02.7541939Z 2025-05-07T19:46:02.7541942Z 2025-05-07T19:46:02.7541949Z 2025-05-07T19:46:02.7541953Z 2025-05-07T19:46:02.7541956Z 2025-05-07T19:46:02.7541959Z 2025-05-07T19:46:02.7541963Z 2025-05-07T19:46:02.7542177Z  2025-05-07T19:46:02.7542453Z 2025-05-07T19:46:02.7542457Z 2025-05-07T19:46:02.7542460Z 2025-05-07T19:46:02.7542463Z 2025-05-07T19:46:02.7542467Z 2025-05-07T19:46:02.7542470Z 2025-05-07T19:46:02.7542474Z 2025-05-07T19:46:02.7542477Z 2025-05-07T19:46:02.7542481Z 2025-05-07T19:46:02.7542484Z 2025-05-07T19:46:02.7542487Z 2025-05-07T19:46:02.7542491Z 2025-05-07T19:46:02.7542494Z 2025-05-07T19:46:02.7542766Z  2025-05-07T19:46:02.7543015Z 2025-05-07T19:46:02.7543018Z 2025-05-07T19:46:02.7543022Z 2025-05-07T19:46:02.7543025Z 2025-05-07T19:46:02.7543029Z 2025-05-07T19:46:02.7543032Z 2025-05-07T19:46:02.7543035Z 2025-05-07T19:46:02.7543039Z 2025-05-07T19:46:02.7543042Z 2025-05-07T19:46:02.7543049Z 2025-05-07T19:46:02.7543053Z 2025-05-07T19:46:02.7543056Z 2025-05-07T19:46:02.7543086Z 2025-05-07T19:46:02.7543089Z 2025-05-07T19:46:02.7543318Z  2025-05-07T19:46:02.7543571Z 2025-05-07T19:46:02.7543575Z 2025-05-07T19:46:02.7543578Z 2025-05-07T19:46:02.7543582Z 2025-05-07T19:46:02.7543585Z 2025-05-07T19:46:02.7543588Z 2025-05-07T19:46:02.7543592Z 2025-05-07T19:46:02.7543619Z 2025-05-07T19:46:02.7543623Z 2025-05-07T19:46:02.7543626Z 2025-05-07T19:46:02.7543630Z 2025-05-07T19:46:02.7543633Z 2025-05-07T19:46:02.7543636Z 2025-05-07T19:46:02.7543640Z 2025-05-07T19:46:02.7543703Z 2025-05-07T19:46:02.7543933Z  2025-05-07T19:46:02.7544191Z 2025-05-07T19:46:02.7544194Z 2025-05-07T19:46:02.7544226Z 2025-05-07T19:46:02.7544230Z 2025-05-07T19:46:02.7544233Z 2025-05-07T19:46:02.7544242Z 2025-05-07T19:46:02.7544245Z 2025-05-07T19:46:02.7544249Z 2025-05-07T19:46:02.7544253Z 2025-05-07T19:46:02.7544256Z 2025-05-07T19:46:02.7544259Z 2025-05-07T19:46:02.7544263Z 2025-05-07T19:46:02.7544266Z 2025-05-07T19:46:02.7544270Z 2025-05-07T19:46:02.7544273Z 2025-05-07T19:46:02.7544276Z 2025-05-07T19:46:02.7544513Z  2025-05-07T19:46:02.7544796Z 2025-05-07T19:46:02.7544800Z 2025-05-07T19:46:02.7544804Z 2025-05-07T19:46:02.7544807Z 2025-05-07T19:46:02.7544810Z 2025-05-07T19:46:02.7544814Z 2025-05-07T19:46:02.7544817Z 2025-05-07T19:46:02.7544820Z 2025-05-07T19:46:02.7544827Z 2025-05-07T19:46:02.7544831Z 2025-05-07T19:46:02.7544834Z 2025-05-07T19:46:02.7544838Z 2025-05-07T19:46:02.7544841Z 2025-05-07T19:46:02.7544844Z 2025-05-07T19:46:02.7544848Z 2025-05-07T19:46:02.7544851Z 2025-05-07T19:46:02.7544854Z 2025-05-07T19:46:02.7545120Z  2025-05-07T19:46:02.7545437Z 2025-05-07T19:46:02.7545440Z 2025-05-07T19:46:02.7545444Z 2025-05-07T19:46:02.7545447Z 2025-05-07T19:46:02.7545451Z 2025-05-07T19:46:02.7545454Z 2025-05-07T19:46:02.7545457Z 2025-05-07T19:46:02.7545461Z 2025-05-07T19:46:02.7545464Z 2025-05-07T19:46:02.7545468Z 2025-05-07T19:46:02.7545471Z 2025-05-07T19:46:02.7545475Z 2025-05-07T19:46:02.7545508Z 2025-05-07T19:46:02.7545511Z 2025-05-07T19:46:02.7545514Z 2025-05-07T19:46:02.7545518Z 2025-05-07T19:46:02.7545521Z 2025-05-07T19:46:02.7545525Z 2025-05-07T19:46:02.7545788Z  2025-05-07T19:46:02.7546088Z 2025-05-07T19:46:02.7546092Z 2025-05-07T19:46:02.7546204Z  2025-05-07T19:46:02.7546325Z 2025-05-07T19:46:02.7546328Z 2025-05-07T19:46:02.7546613Z  2025-05-07T19:46:02.7546776Z 2025-05-07T19:46:02.7546779Z 2025-05-07T19:46:02.7546783Z 2025-05-07T19:46:02.7546910Z  2025-05-07T19:46:02.7547034Z 2025-05-07T19:46:02.7547038Z 2025-05-07T19:46:02.7547072Z 2025-05-07T19:46:02.7547076Z 2025-05-07T19:46:02.7547196Z  2025-05-07T19:46:02.7547329Z 2025-05-07T19:46:02.7547333Z 2025-05-07T19:46:02.7547337Z 2025-05-07T19:46:02.7547340Z 2025-05-07T19:46:02.7547343Z 2025-05-07T19:46:02.7547503Z  2025-05-07T19:46:02.7547646Z 2025-05-07T19:46:02.7547650Z 2025-05-07T19:46:02.7547653Z 2025-05-07T19:46:02.7547657Z 2025-05-07T19:46:02.7547660Z 2025-05-07T19:46:02.7547663Z 2025-05-07T19:46:02.7547791Z  2025-05-07T19:46:02.7547967Z 2025-05-07T19:46:02.7547971Z 2025-05-07T19:46:02.7547978Z 2025-05-07T19:46:02.7547981Z 2025-05-07T19:46:02.7547985Z 2025-05-07T19:46:02.7547988Z 2025-05-07T19:46:02.7547991Z 2025-05-07T19:46:02.7548120Z  2025-05-07T19:46:02.7548278Z 2025-05-07T19:46:02.7548307Z 2025-05-07T19:46:02.7548311Z 2025-05-07T19:46:02.7548314Z 2025-05-07T19:46:02.7548322Z 2025-05-07T19:46:02.7548325Z 2025-05-07T19:46:02.7548328Z 2025-05-07T19:46:02.7548332Z 2025-05-07T19:46:02.7548473Z  2025-05-07T19:46:02.7548642Z 2025-05-07T19:46:02.7548646Z 2025-05-07T19:46:02.7548649Z 2025-05-07T19:46:02.7548653Z 2025-05-07T19:46:02.7548687Z 2025-05-07T19:46:02.7548691Z 2025-05-07T19:46:02.7548694Z 2025-05-07T19:46:02.7548698Z 2025-05-07T19:46:02.7548701Z 2025-05-07T19:46:02.7548866Z  2025-05-07T19:46:02.7549073Z 2025-05-07T19:46:02.7549077Z 2025-05-07T19:46:02.7549080Z 2025-05-07T19:46:02.7549084Z 2025-05-07T19:46:02.7549087Z 2025-05-07T19:46:02.7549090Z 2025-05-07T19:46:02.7549094Z 2025-05-07T19:46:02.7549177Z 2025-05-07T19:46:02.7549181Z 2025-05-07T19:46:02.7549185Z 2025-05-07T19:46:02.7549332Z  2025-05-07T19:46:02.7549548Z 2025-05-07T19:46:02.7549551Z 2025-05-07T19:46:02.7549554Z 2025-05-07T19:46:02.7549558Z 2025-05-07T19:46:02.7549561Z 2025-05-07T19:46:02.7549564Z 2025-05-07T19:46:02.7549572Z 2025-05-07T19:46:02.7549575Z 2025-05-07T19:46:02.7549578Z 2025-05-07T19:46:02.7549581Z 2025-05-07T19:46:02.7549585Z 2025-05-07T19:46:02.7549733Z  2025-05-07T19:46:02.7549959Z 2025-05-07T19:46:02.7549963Z 2025-05-07T19:46:02.7549966Z 2025-05-07T19:46:02.7549970Z 2025-05-07T19:46:02.7549973Z 2025-05-07T19:46:02.7549976Z 2025-05-07T19:46:02.7549980Z 2025-05-07T19:46:02.7549983Z 2025-05-07T19:46:02.7549986Z 2025-05-07T19:46:02.7549990Z 2025-05-07T19:46:02.7549993Z 2025-05-07T19:46:02.7549997Z 2025-05-07T19:46:02.7550149Z  2025-05-07T19:46:02.7550390Z 2025-05-07T19:46:02.7550394Z 2025-05-07T19:46:02.7550401Z 2025-05-07T19:46:02.7550405Z 2025-05-07T19:46:02.7550408Z 2025-05-07T19:46:02.7550412Z 2025-05-07T19:46:02.7550415Z 2025-05-07T19:46:02.7550418Z 2025-05-07T19:46:02.7550422Z 2025-05-07T19:46:02.7550425Z 2025-05-07T19:46:02.7550428Z 2025-05-07T19:46:02.7550432Z 2025-05-07T19:46:02.7550435Z 2025-05-07T19:46:02.7550663Z  2025-05-07T19:46:02.7550905Z 2025-05-07T19:46:02.7550909Z 2025-05-07T19:46:02.7550912Z 2025-05-07T19:46:02.7550916Z 2025-05-07T19:46:02.7550919Z 2025-05-07T19:46:02.7550922Z 2025-05-07T19:46:02.7550926Z 2025-05-07T19:46:02.7550929Z 2025-05-07T19:46:02.7550932Z 2025-05-07T19:46:02.7550936Z 2025-05-07T19:46:02.7550939Z 2025-05-07T19:46:02.7550942Z 2025-05-07T19:46:02.7550946Z 2025-05-07T19:46:02.7550949Z 2025-05-07T19:46:02.7551134Z  2025-05-07T19:46:02.7551348Z 2025-05-07T19:46:02.7551352Z 2025-05-07T19:46:02.7551355Z 2025-05-07T19:46:02.7551359Z 2025-05-07T19:46:02.7551366Z 2025-05-07T19:46:02.7551369Z 2025-05-07T19:46:02.7551372Z 2025-05-07T19:46:02.7551376Z 2025-05-07T19:46:02.7551379Z 2025-05-07T19:46:02.7551383Z 2025-05-07T19:46:02.7551386Z 2025-05-07T19:46:02.7551389Z 2025-05-07T19:46:02.7551393Z 2025-05-07T19:46:02.7551396Z 2025-05-07T19:46:02.7551399Z 2025-05-07T19:46:02.7551593Z  2025-05-07T19:46:02.7551817Z 2025-05-07T19:46:02.7551820Z 2025-05-07T19:46:02.7551824Z 2025-05-07T19:46:02.7551827Z 2025-05-07T19:46:02.7551830Z 2025-05-07T19:46:02.7551834Z 2025-05-07T19:46:02.7551837Z 2025-05-07T19:46:02.7551841Z 2025-05-07T19:46:02.7551844Z 2025-05-07T19:46:02.7551847Z 2025-05-07T19:46:02.7551850Z 2025-05-07T19:46:02.7551854Z 2025-05-07T19:46:02.7551883Z 2025-05-07T19:46:02.7551886Z 2025-05-07T19:46:02.7551890Z 2025-05-07T19:46:02.7551893Z 2025-05-07T19:46:02.7552060Z  2025-05-07T19:46:02.7552291Z 2025-05-07T19:46:02.7552295Z 2025-05-07T19:46:02.7552301Z 2025-05-07T19:46:02.7552305Z 2025-05-07T19:46:02.7552308Z 2025-05-07T19:46:02.7552311Z 2025-05-07T19:46:02.7552350Z 2025-05-07T19:46:02.7552354Z 2025-05-07T19:46:02.7552357Z 2025-05-07T19:46:02.7552360Z 2025-05-07T19:46:02.7552363Z 2025-05-07T19:46:02.7552367Z 2025-05-07T19:46:02.7552370Z 2025-05-07T19:46:02.7552377Z 2025-05-07T19:46:02.7552381Z 2025-05-07T19:46:02.7552384Z 2025-05-07T19:46:02.7552388Z 2025-05-07T19:46:02.7552563Z  2025-05-07T19:46:02.7552829Z 2025-05-07T19:46:02.7552832Z 2025-05-07T19:46:02.7552836Z 2025-05-07T19:46:02.7552839Z 2025-05-07T19:46:02.7552842Z 2025-05-07T19:46:02.7552846Z 2025-05-07T19:46:02.7552849Z 2025-05-07T19:46:02.7552853Z 2025-05-07T19:46:02.7552856Z 2025-05-07T19:46:02.7552860Z 2025-05-07T19:46:02.7552889Z 2025-05-07T19:46:02.7552892Z 2025-05-07T19:46:02.7552896Z 2025-05-07T19:46:02.7552899Z 2025-05-07T19:46:02.7552902Z 2025-05-07T19:46:02.7552906Z 2025-05-07T19:46:02.7552909Z 2025-05-07T19:46:02.7553878Z 2025-05-07T19:46:02.7554092Z  2025-05-07T19:46:02.7554331Z 2025-05-07T19:46:02.7554334Z 2025-05-07T19:46:02.7554450Z  2025-05-07T19:46:02.7554609Z 2025-05-07T19:46:02.7554613Z 2025-05-07T19:46:02.7554732Z  2025-05-07T19:46:02.7554854Z 2025-05-07T19:46:02.7554863Z 2025-05-07T19:46:02.7554866Z 2025-05-07T19:46:02.7555016Z  2025-05-07T19:46:02.7555143Z 2025-05-07T19:46:02.7555146Z 2025-05-07T19:46:02.7555150Z 2025-05-07T19:46:02.7555153Z 2025-05-07T19:46:02.7555273Z  2025-05-07T19:46:02.7555432Z 2025-05-07T19:46:02.7555435Z 2025-05-07T19:46:02.7555439Z 2025-05-07T19:46:02.7555442Z 2025-05-07T19:46:02.7555446Z 2025-05-07T19:46:02.7555568Z  2025-05-07T19:46:02.7555706Z 2025-05-07T19:46:02.7555740Z 2025-05-07T19:46:02.7555744Z 2025-05-07T19:46:02.7555747Z 2025-05-07T19:46:02.7555751Z 2025-05-07T19:46:02.7555754Z 2025-05-07T19:46:02.7555959Z  2025-05-07T19:46:02.7556114Z 2025-05-07T19:46:02.7556118Z 2025-05-07T19:46:02.7556121Z 2025-05-07T19:46:02.7556125Z 2025-05-07T19:46:02.7556128Z 2025-05-07T19:46:02.7556132Z 2025-05-07T19:46:02.7556166Z 2025-05-07T19:46:02.7556299Z  2025-05-07T19:46:02.7556455Z 2025-05-07T19:46:02.7556458Z 2025-05-07T19:46:02.7556542Z 2025-05-07T19:46:02.7556545Z 2025-05-07T19:46:02.7556549Z 2025-05-07T19:46:02.7556552Z 2025-05-07T19:46:02.7556556Z 2025-05-07T19:46:02.7556559Z 2025-05-07T19:46:02.7556721Z  2025-05-07T19:46:02.7556890Z 2025-05-07T19:46:02.7556893Z 2025-05-07T19:46:02.7556897Z 2025-05-07T19:46:02.7556900Z 2025-05-07T19:46:02.7556904Z 2025-05-07T19:46:02.7556907Z 2025-05-07T19:46:02.7556911Z 2025-05-07T19:46:02.7556914Z 2025-05-07T19:46:02.7556918Z 2025-05-07T19:46:02.7557085Z  2025-05-07T19:46:02.7557261Z 2025-05-07T19:46:02.7557265Z 2025-05-07T19:46:02.7557268Z 2025-05-07T19:46:02.7557271Z 2025-05-07T19:46:02.7557275Z 2025-05-07T19:46:02.7557282Z 2025-05-07T19:46:02.7557286Z 2025-05-07T19:46:02.7557289Z 2025-05-07T19:46:02.7557293Z 2025-05-07T19:46:02.7557296Z 2025-05-07T19:46:02.7557468Z  2025-05-07T19:46:02.7557656Z 2025-05-07T19:46:02.7557660Z 2025-05-07T19:46:02.7557663Z 2025-05-07T19:46:02.7557667Z 2025-05-07T19:46:02.7557675Z 2025-05-07T19:46:02.7557678Z 2025-05-07T19:46:02.7557682Z 2025-05-07T19:46:02.7557685Z 2025-05-07T19:46:02.7557689Z 2025-05-07T19:46:02.7557692Z 2025-05-07T19:46:02.7557695Z 2025-05-07T19:46:02.7557876Z  2025-05-07T19:46:02.7558068Z 2025-05-07T19:46:02.7558072Z 2025-05-07T19:46:02.7558075Z 2025-05-07T19:46:02.7558079Z 2025-05-07T19:46:02.7558082Z 2025-05-07T19:46:02.7558085Z 2025-05-07T19:46:02.7558089Z 2025-05-07T19:46:02.7558092Z 2025-05-07T19:46:02.7558095Z 2025-05-07T19:46:02.7558099Z 2025-05-07T19:46:02.7558102Z 2025-05-07T19:46:02.7558105Z 2025-05-07T19:46:02.7558279Z  2025-05-07T19:46:02.7558478Z 2025-05-07T19:46:02.7558482Z 2025-05-07T19:46:02.7558485Z 2025-05-07T19:46:02.7558488Z 2025-05-07T19:46:02.7558492Z 2025-05-07T19:46:02.7558495Z 2025-05-07T19:46:02.7558498Z 2025-05-07T19:46:02.7558502Z 2025-05-07T19:46:02.7558505Z 2025-05-07T19:46:02.7558508Z 2025-05-07T19:46:02.7558515Z 2025-05-07T19:46:02.7558518Z 2025-05-07T19:46:02.7558522Z 2025-05-07T19:46:02.7558694Z  2025-05-07T19:46:02.7558901Z 2025-05-07T19:46:02.7558904Z 2025-05-07T19:46:02.7558908Z 2025-05-07T19:46:02.7558911Z 2025-05-07T19:46:02.7558914Z 2025-05-07T19:46:02.7558918Z 2025-05-07T19:46:02.7558921Z 2025-05-07T19:46:02.7558925Z 2025-05-07T19:46:02.7558928Z 2025-05-07T19:46:02.7558931Z 2025-05-07T19:46:02.7558935Z 2025-05-07T19:46:02.7558938Z 2025-05-07T19:46:02.7558969Z 2025-05-07T19:46:02.7558973Z 2025-05-07T19:46:02.7559220Z  2025-05-07T19:46:02.7559431Z 2025-05-07T19:46:02.7559435Z 2025-05-07T19:46:02.7559491Z 2025-05-07T19:46:02.7559495Z 2025-05-07T19:46:02.7559498Z 2025-05-07T19:46:02.7559502Z 2025-05-07T19:46:02.7559505Z 2025-05-07T19:46:02.7559509Z 2025-05-07T19:46:02.7559538Z 2025-05-07T19:46:02.7559542Z 2025-05-07T19:46:02.7559545Z 2025-05-07T19:46:02.7559549Z 2025-05-07T19:46:02.7559552Z 2025-05-07T19:46:02.7559560Z 2025-05-07T19:46:02.7559563Z 2025-05-07T19:46:02.7559721Z  2025-05-07T19:46:02.7559945Z 2025-05-07T19:46:02.7559949Z 2025-05-07T19:46:02.7559953Z 2025-05-07T19:46:02.7559987Z 2025-05-07T19:46:02.7559991Z 2025-05-07T19:46:02.7559994Z 2025-05-07T19:46:02.7559998Z 2025-05-07T19:46:02.7560001Z 2025-05-07T19:46:02.7560005Z 2025-05-07T19:46:02.7560008Z 2025-05-07T19:46:02.7560011Z 2025-05-07T19:46:02.7560015Z 2025-05-07T19:46:02.7560018Z 2025-05-07T19:46:02.7560021Z 2025-05-07T19:46:02.7560025Z 2025-05-07T19:46:02.7560028Z 2025-05-07T19:46:02.7560194Z  2025-05-07T19:46:02.7560453Z 2025-05-07T19:46:02.7560457Z 2025-05-07T19:46:02.7560460Z 2025-05-07T19:46:02.7560464Z 2025-05-07T19:46:02.7560467Z 2025-05-07T19:46:02.7560470Z 2025-05-07T19:46:02.7560474Z 2025-05-07T19:46:02.7560477Z 2025-05-07T19:46:02.7560481Z 2025-05-07T19:46:02.7560484Z 2025-05-07T19:46:02.7560487Z 2025-05-07T19:46:02.7560540Z 2025-05-07T19:46:02.7560544Z 2025-05-07T19:46:02.7560547Z 2025-05-07T19:46:02.7560550Z 2025-05-07T19:46:02.7560554Z 2025-05-07T19:46:02.7560557Z 2025-05-07T19:46:02.7560758Z  2025-05-07T19:46:02.7560987Z 2025-05-07T19:46:02.7560990Z 2025-05-07T19:46:02.7560993Z 2025-05-07T19:46:02.7560997Z 2025-05-07T19:46:02.7561000Z 2025-05-07T19:46:02.7561003Z 2025-05-07T19:46:02.7561007Z 2025-05-07T19:46:02.7561010Z 2025-05-07T19:46:02.7561014Z 2025-05-07T19:46:02.7561017Z 2025-05-07T19:46:02.7561021Z 2025-05-07T19:46:02.7561024Z 2025-05-07T19:46:02.7561027Z 2025-05-07T19:46:02.7561031Z 2025-05-07T19:46:02.7561038Z 2025-05-07T19:46:02.7561069Z 2025-05-07T19:46:02.7561073Z 2025-05-07T19:46:02.7561076Z 2025-05-07T19:46:02.7561259Z  2025-05-07T19:46:02.7561502Z 2025-05-07T19:46:02.7561506Z 2025-05-07T19:46:02.7561643Z  2025-05-07T19:46:02.7561763Z 2025-05-07T19:46:02.7561770Z 2025-05-07T19:46:02.7561881Z  2025-05-07T19:46:02.7562001Z 2025-05-07T19:46:02.7562004Z 2025-05-07T19:46:02.7562034Z 2025-05-07T19:46:02.7562146Z  2025-05-07T19:46:02.7562270Z 2025-05-07T19:46:02.7562274Z 2025-05-07T19:46:02.7562278Z 2025-05-07T19:46:02.7562281Z 2025-05-07T19:46:02.7562400Z  2025-05-07T19:46:02.7562561Z 2025-05-07T19:46:02.7562565Z 2025-05-07T19:46:02.7562568Z 2025-05-07T19:46:02.7562572Z 2025-05-07T19:46:02.7562575Z 2025-05-07T19:46:02.7562695Z  2025-05-07T19:46:02.7562862Z 2025-05-07T19:46:02.7562866Z 2025-05-07T19:46:02.7562869Z 2025-05-07T19:46:02.7562873Z 2025-05-07T19:46:02.7562876Z 2025-05-07T19:46:02.7562883Z 2025-05-07T19:46:02.7563007Z  2025-05-07T19:46:02.7563152Z 2025-05-07T19:46:02.7563156Z 2025-05-07T19:46:02.7563160Z 2025-05-07T19:46:02.7563163Z 2025-05-07T19:46:02.7563167Z 2025-05-07T19:46:02.7563203Z 2025-05-07T19:46:02.7563206Z 2025-05-07T19:46:02.7563339Z  2025-05-07T19:46:02.7563496Z 2025-05-07T19:46:02.7563500Z 2025-05-07T19:46:02.7563503Z 2025-05-07T19:46:02.7563507Z 2025-05-07T19:46:02.7563510Z 2025-05-07T19:46:02.7563513Z 2025-05-07T19:46:02.7563517Z 2025-05-07T19:46:02.7563520Z 2025-05-07T19:46:02.7563682Z  2025-05-07T19:46:02.7563848Z 2025-05-07T19:46:02.7563852Z 2025-05-07T19:46:02.7563855Z 2025-05-07T19:46:02.7563859Z 2025-05-07T19:46:02.7563862Z 2025-05-07T19:46:02.7563865Z 2025-05-07T19:46:02.7563869Z 2025-05-07T19:46:02.7563872Z 2025-05-07T19:46:02.7563875Z 2025-05-07T19:46:02.7564049Z  2025-05-07T19:46:02.7564222Z 2025-05-07T19:46:02.7564225Z 2025-05-07T19:46:02.7564277Z 2025-05-07T19:46:02.7564282Z 2025-05-07T19:46:02.7564285Z 2025-05-07T19:46:02.7564289Z 2025-05-07T19:46:02.7564292Z 2025-05-07T19:46:02.7564295Z 2025-05-07T19:46:02.7564299Z 2025-05-07T19:46:02.7564302Z 2025-05-07T19:46:02.7564467Z  2025-05-07T19:46:02.7564648Z 2025-05-07T19:46:02.7564655Z 2025-05-07T19:46:02.7564659Z 2025-05-07T19:46:02.7564662Z 2025-05-07T19:46:02.7564665Z 2025-05-07T19:46:02.7564669Z 2025-05-07T19:46:02.7564672Z 2025-05-07T19:46:02.7564676Z 2025-05-07T19:46:02.7564679Z 2025-05-07T19:46:02.7564683Z 2025-05-07T19:46:02.7564686Z 2025-05-07T19:46:02.7564853Z  2025-05-07T19:46:02.7565047Z 2025-05-07T19:46:02.7565050Z 2025-05-07T19:46:02.7565054Z 2025-05-07T19:46:02.7565057Z 2025-05-07T19:46:02.7565061Z 2025-05-07T19:46:02.7565065Z 2025-05-07T19:46:02.7565068Z 2025-05-07T19:46:02.7565071Z 2025-05-07T19:46:02.7565075Z 2025-05-07T19:46:02.7565078Z 2025-05-07T19:46:02.7565082Z 2025-05-07T19:46:02.7565088Z 2025-05-07T19:46:02.7565265Z  2025-05-07T19:46:02.7565461Z 2025-05-07T19:46:02.7565464Z 2025-05-07T19:46:02.7565468Z 2025-05-07T19:46:02.7565471Z 2025-05-07T19:46:02.7565475Z 2025-05-07T19:46:02.7565478Z 2025-05-07T19:46:02.7565481Z 2025-05-07T19:46:02.7565485Z 2025-05-07T19:46:02.7565538Z 2025-05-07T19:46:02.7565542Z 2025-05-07T19:46:02.7565546Z 2025-05-07T19:46:02.7565549Z 2025-05-07T19:46:02.7565553Z 2025-05-07T19:46:02.7565734Z  2025-05-07T19:46:02.7565943Z 2025-05-07T19:46:02.7565947Z 2025-05-07T19:46:02.7565950Z 2025-05-07T19:46:02.7565953Z 2025-05-07T19:46:02.7565957Z 2025-05-07T19:46:02.7565960Z 2025-05-07T19:46:02.7565963Z 2025-05-07T19:46:02.7565967Z 2025-05-07T19:46:02.7565970Z 2025-05-07T19:46:02.7565974Z 2025-05-07T19:46:02.7565977Z 2025-05-07T19:46:02.7566007Z 2025-05-07T19:46:02.7566011Z 2025-05-07T19:46:02.7566014Z 2025-05-07T19:46:02.7566174Z  2025-05-07T19:46:02.7566390Z 2025-05-07T19:46:02.7566394Z 2025-05-07T19:46:02.7566397Z 2025-05-07T19:46:02.7566400Z 2025-05-07T19:46:02.7566404Z 2025-05-07T19:46:02.7566407Z 2025-05-07T19:46:02.7566411Z 2025-05-07T19:46:02.7566445Z 2025-05-07T19:46:02.7566448Z 2025-05-07T19:46:02.7566452Z 2025-05-07T19:46:02.7566455Z 2025-05-07T19:46:02.7566462Z 2025-05-07T19:46:02.7566465Z 2025-05-07T19:46:02.7566469Z 2025-05-07T19:46:02.7566472Z 2025-05-07T19:46:02.7566636Z  2025-05-07T19:46:02.7566851Z 2025-05-07T19:46:02.7566854Z 2025-05-07T19:46:02.7566858Z 2025-05-07T19:46:02.7566888Z 2025-05-07T19:46:02.7566892Z 2025-05-07T19:46:02.7566895Z 2025-05-07T19:46:02.7566898Z 2025-05-07T19:46:02.7566902Z 2025-05-07T19:46:02.7566905Z 2025-05-07T19:46:02.7566908Z 2025-05-07T19:46:02.7566912Z 2025-05-07T19:46:02.7566915Z 2025-05-07T19:46:02.7566919Z 2025-05-07T19:46:02.7566922Z 2025-05-07T19:46:02.7566925Z 2025-05-07T19:46:02.7566929Z 2025-05-07T19:46:02.7567097Z  2025-05-07T19:46:02.7567348Z 2025-05-07T19:46:02.7567352Z 2025-05-07T19:46:02.7567355Z 2025-05-07T19:46:02.7567359Z 2025-05-07T19:46:02.7567362Z 2025-05-07T19:46:02.7567365Z 2025-05-07T19:46:02.7567369Z 2025-05-07T19:46:02.7567372Z 2025-05-07T19:46:02.7567375Z 2025-05-07T19:46:02.7567382Z 2025-05-07T19:46:02.7567386Z 2025-05-07T19:46:02.7567389Z 2025-05-07T19:46:02.7567393Z 2025-05-07T19:46:02.7567396Z 2025-05-07T19:46:02.7567400Z 2025-05-07T19:46:02.7567403Z 2025-05-07T19:46:02.7567407Z 2025-05-07T19:46:02.7567612Z  2025-05-07T19:46:02.7567838Z 2025-05-07T19:46:02.7567842Z 2025-05-07T19:46:02.7567845Z 2025-05-07T19:46:02.7567849Z 2025-05-07T19:46:02.7567852Z 2025-05-07T19:46:02.7567856Z 2025-05-07T19:46:02.7567859Z 2025-05-07T19:46:02.7567862Z 2025-05-07T19:46:02.7567866Z 2025-05-07T19:46:02.7567869Z 2025-05-07T19:46:02.7567872Z 2025-05-07T19:46:02.7567876Z 2025-05-07T19:46:02.7567927Z 2025-05-07T19:46:02.7567931Z 2025-05-07T19:46:02.7567935Z 2025-05-07T19:46:02.7567966Z 2025-05-07T19:46:02.7567969Z 2025-05-07T19:46:02.7567973Z 2025-05-07T19:46:02.7568153Z  2025-05-07T19:46:02.7568386Z 2025-05-07T19:46:02.7568389Z 2025-05-07T19:46:02.7568535Z  2025-05-07T19:46:02.7568657Z 2025-05-07T19:46:02.7568661Z 2025-05-07T19:46:02.7568846Z  2025-05-07T19:46:02.7568969Z 2025-05-07T19:46:02.7568973Z 2025-05-07T19:46:02.7568976Z 2025-05-07T19:46:02.7569089Z  2025-05-07T19:46:02.7569240Z 2025-05-07T19:46:02.7569243Z 2025-05-07T19:46:02.7569247Z 2025-05-07T19:46:02.7569250Z 2025-05-07T19:46:02.7569367Z  2025-05-07T19:46:02.7569498Z 2025-05-07T19:46:02.7569502Z 2025-05-07T19:46:02.7569505Z 2025-05-07T19:46:02.7569509Z 2025-05-07T19:46:02.7569537Z 2025-05-07T19:46:02.7569659Z  2025-05-07T19:46:02.7569827Z 2025-05-07T19:46:02.7569831Z 2025-05-07T19:46:02.7569834Z 2025-05-07T19:46:02.7569841Z 2025-05-07T19:46:02.7569844Z 2025-05-07T19:46:02.7569848Z 2025-05-07T19:46:02.7569975Z  2025-05-07T19:46:02.7570119Z 2025-05-07T19:46:02.7570149Z 2025-05-07T19:46:02.7570152Z 2025-05-07T19:46:02.7570156Z 2025-05-07T19:46:02.7570159Z 2025-05-07T19:46:02.7570162Z 2025-05-07T19:46:02.7570231Z 2025-05-07T19:46:02.7570370Z  2025-05-07T19:46:02.7570526Z 2025-05-07T19:46:02.7570529Z 2025-05-07T19:46:02.7570533Z 2025-05-07T19:46:02.7570536Z 2025-05-07T19:46:02.7570539Z 2025-05-07T19:46:02.7570571Z 2025-05-07T19:46:02.7570575Z 2025-05-07T19:46:02.7570578Z 2025-05-07T19:46:02.7570710Z  2025-05-07T19:46:02.7570876Z 2025-05-07T19:46:02.7570879Z 2025-05-07T19:46:02.7570883Z 2025-05-07T19:46:02.7570886Z 2025-05-07T19:46:02.7570890Z 2025-05-07T19:46:02.7570893Z 2025-05-07T19:46:02.7570896Z 2025-05-07T19:46:02.7570900Z 2025-05-07T19:46:02.7570929Z 2025-05-07T19:46:02.7571064Z  2025-05-07T19:46:02.7571239Z 2025-05-07T19:46:02.7571243Z 2025-05-07T19:46:02.7571247Z 2025-05-07T19:46:02.7571250Z 2025-05-07T19:46:02.7571253Z 2025-05-07T19:46:02.7571257Z 2025-05-07T19:46:02.7571260Z 2025-05-07T19:46:02.7571264Z 2025-05-07T19:46:02.7571267Z 2025-05-07T19:46:02.7571270Z 2025-05-07T19:46:02.7571443Z  2025-05-07T19:46:02.7571628Z 2025-05-07T19:46:02.7571631Z 2025-05-07T19:46:02.7571635Z 2025-05-07T19:46:02.7571638Z 2025-05-07T19:46:02.7571642Z 2025-05-07T19:46:02.7571645Z 2025-05-07T19:46:02.7571649Z 2025-05-07T19:46:02.7571652Z 2025-05-07T19:46:02.7571655Z 2025-05-07T19:46:02.7571659Z 2025-05-07T19:46:02.7571663Z 2025-05-07T19:46:02.7571834Z  2025-05-07T19:46:02.7572031Z 2025-05-07T19:46:02.7572034Z 2025-05-07T19:46:02.7572038Z 2025-05-07T19:46:02.7572041Z 2025-05-07T19:46:02.7572045Z 2025-05-07T19:46:02.7572048Z 2025-05-07T19:46:02.7572051Z 2025-05-07T19:46:02.7572055Z 2025-05-07T19:46:02.7572058Z 2025-05-07T19:46:02.7572065Z 2025-05-07T19:46:02.7572068Z 2025-05-07T19:46:02.7572103Z 2025-05-07T19:46:02.7572256Z  2025-05-07T19:46:02.7572453Z 2025-05-07T19:46:02.7572457Z 2025-05-07T19:46:02.7572460Z 2025-05-07T19:46:02.7572464Z 2025-05-07T19:46:02.7572467Z 2025-05-07T19:46:02.7572470Z 2025-05-07T19:46:02.7572477Z 2025-05-07T19:46:02.7572480Z 2025-05-07T19:46:02.7572484Z 2025-05-07T19:46:02.7572523Z 2025-05-07T19:46:02.7572527Z 2025-05-07T19:46:02.7572530Z 2025-05-07T19:46:02.7572534Z 2025-05-07T19:46:02.7572689Z  2025-05-07T19:46:02.7572902Z 2025-05-07T19:46:02.7572905Z 2025-05-07T19:46:02.7572909Z 2025-05-07T19:46:02.7572912Z 2025-05-07T19:46:02.7572915Z 2025-05-07T19:46:02.7572919Z 2025-05-07T19:46:02.7572922Z 2025-05-07T19:46:02.7572955Z 2025-05-07T19:46:02.7572959Z 2025-05-07T19:46:02.7572962Z 2025-05-07T19:46:02.7572966Z 2025-05-07T19:46:02.7572969Z 2025-05-07T19:46:02.7572973Z 2025-05-07T19:46:02.7572976Z 2025-05-07T19:46:02.7573196Z  2025-05-07T19:46:02.7573416Z 2025-05-07T19:46:02.7573419Z 2025-05-07T19:46:02.7573423Z 2025-05-07T19:46:02.7573426Z 2025-05-07T19:46:02.7573461Z 2025-05-07T19:46:02.7573464Z 2025-05-07T19:46:02.7573467Z 2025-05-07T19:46:02.7573471Z 2025-05-07T19:46:02.7573477Z 2025-05-07T19:46:02.7573481Z 2025-05-07T19:46:02.7573484Z 2025-05-07T19:46:02.7573488Z 2025-05-07T19:46:02.7573491Z 2025-05-07T19:46:02.7573495Z 2025-05-07T19:46:02.7573498Z 2025-05-07T19:46:02.7573664Z  2025-05-07T19:46:02.7573919Z 2025-05-07T19:46:02.7573922Z 2025-05-07T19:46:02.7573925Z 2025-05-07T19:46:02.7573929Z 2025-05-07T19:46:02.7573932Z 2025-05-07T19:46:02.7573935Z 2025-05-07T19:46:02.7573939Z 2025-05-07T19:46:02.7573942Z 2025-05-07T19:46:02.7573946Z 2025-05-07T19:46:02.7573949Z 2025-05-07T19:46:02.7573953Z 2025-05-07T19:46:02.7573956Z 2025-05-07T19:46:02.7573959Z 2025-05-07T19:46:02.7573963Z 2025-05-07T19:46:02.7573969Z 2025-05-07T19:46:02.7573973Z 2025-05-07T19:46:02.7574144Z  2025-05-07T19:46:02.7574406Z 2025-05-07T19:46:02.7574410Z 2025-05-07T19:46:02.7574413Z 2025-05-07T19:46:02.7574417Z 2025-05-07T19:46:02.7574420Z 2025-05-07T19:46:02.7574423Z 2025-05-07T19:46:02.7574476Z 2025-05-07T19:46:02.7574479Z 2025-05-07T19:46:02.7574483Z 2025-05-07T19:46:02.7574486Z 2025-05-07T19:46:02.7574489Z 2025-05-07T19:46:02.7574493Z 2025-05-07T19:46:02.7574496Z 2025-05-07T19:46:02.7574499Z 2025-05-07T19:46:02.7574503Z 2025-05-07T19:46:02.7574506Z 2025-05-07T19:46:02.7574509Z 2025-05-07T19:46:02.7574720Z  2025-05-07T19:46:02.7574955Z 2025-05-07T19:46:02.7574959Z 2025-05-07T19:46:02.7574962Z 2025-05-07T19:46:02.7574966Z 2025-05-07T19:46:02.7574969Z 2025-05-07T19:46:02.7574972Z 2025-05-07T19:46:02.7574976Z 2025-05-07T19:46:02.7574979Z 2025-05-07T19:46:02.7574983Z 2025-05-07T19:46:02.7574986Z 2025-05-07T19:46:02.7574993Z 2025-05-07T19:46:02.7575027Z 2025-05-07T19:46:02.7575031Z 2025-05-07T19:46:02.7575035Z 2025-05-07T19:46:02.7575038Z 2025-05-07T19:46:02.7575042Z 2025-05-07T19:46:02.7575045Z 2025-05-07T19:46:02.7575048Z 2025-05-07T19:46:02.7575236Z  2025-05-07T19:46:02.7575475Z 2025-05-07T19:46:02.7575479Z 2025-05-07T19:46:02.7575628Z  2025-05-07T19:46:02.7575753Z 2025-05-07T19:46:02.7575756Z 2025-05-07T19:46:02.7575871Z  2025-05-07T19:46:02.7576031Z 2025-05-07T19:46:02.7576035Z 2025-05-07T19:46:02.7576038Z 2025-05-07T19:46:02.7576153Z  2025-05-07T19:46:02.7576279Z 2025-05-07T19:46:02.7576282Z 2025-05-07T19:46:02.7576285Z 2025-05-07T19:46:02.7576289Z 2025-05-07T19:46:02.7576443Z  2025-05-07T19:46:02.7576579Z 2025-05-07T19:46:02.7576582Z 2025-05-07T19:46:02.7576585Z 2025-05-07T19:46:02.7576589Z 2025-05-07T19:46:02.7576592Z 2025-05-07T19:46:02.7576719Z  2025-05-07T19:46:02.7576897Z 2025-05-07T19:46:02.7576903Z 2025-05-07T19:46:02.7576907Z 2025-05-07T19:46:02.7576910Z 2025-05-07T19:46:02.7576914Z 2025-05-07T19:46:02.7576917Z 2025-05-07T19:46:02.7577044Z  2025-05-07T19:46:02.7577192Z 2025-05-07T19:46:02.7577227Z 2025-05-07T19:46:02.7577231Z 2025-05-07T19:46:02.7577234Z 2025-05-07T19:46:02.7577241Z 2025-05-07T19:46:02.7577244Z 2025-05-07T19:46:02.7577248Z 2025-05-07T19:46:02.7577378Z  2025-05-07T19:46:02.7577535Z 2025-05-07T19:46:02.7577540Z 2025-05-07T19:46:02.7577543Z 2025-05-07T19:46:02.7577547Z 2025-05-07T19:46:02.7577550Z 2025-05-07T19:46:02.7577587Z 2025-05-07T19:46:02.7577591Z 2025-05-07T19:46:02.7577594Z 2025-05-07T19:46:02.7577745Z  done 2025-05-07T19:46:02.9688360Z Preparing transaction: / - done 2025-05-07T19:46:03.7719261Z Verifying transaction: | / - \ | / - \ done 2025-05-07T19:46:04.0779149Z Executing transaction: / - \ done 2025-05-07T19:46:06.0621002Z [INSTALL] Fixing file placements for CUDA 12.6.3+ ... 2025-05-07T19:46:06.0622458Z [INSTALL] Creating symlinks: libnvToolsExt.so 2025-05-07T19:46:06.0624643Z + ln -sf /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:06.0625746Z 2025-05-07T19:46:06.0640570Z 2025-05-07T19:46:06.0641757Z + ln -sf /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:06.0642697Z 2025-05-07T19:46:06.0662672Z 2025-05-07T19:46:06.0663297Z [INSTALL] Copying nvtx3 headers ... 2025-05-07T19:46:06.0667947Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/include/ 2025-05-07T19:46:06.0672355Z 2025-05-07T19:46:06.0878024Z 2025-05-07T19:46:06.0886318Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/ 2025-05-07T19:46:06.0890559Z 2025-05-07T19:46:06.0898550Z 2025-05-07T19:46:06.0899385Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:46:06.1318132Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs ... 2025-05-07T19:46:07.9330981Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:46:08.0087115Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs 2025-05-07T19:46:08.0087766Z 2025-05-07T19:46:08.4186756Z 2025-05-07T19:46:08.4190744Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:46:08.4555077Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:08.4555704Z 2025-05-07T19:46:08.8656866Z 2025-05-07T19:46:08.8657864Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:46:08.8658990Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:46:08.8659781Z 2025-05-07T19:46:09.2827807Z 2025-05-07T19:46:11.2656369Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/cuda_runtime.h 2025-05-07T19:46:13.2331041Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:46:15.1727985Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:15.1728978Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:17.0955138Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:18.8909361Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:46:18.8909741Z 2025-05-07T19:46:18.9494489Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:46:22.6055249Z /tmp/tmpo8jb1ds8: line 3: clang: command not found 2025-05-07T19:46:22.6055668Z 2025-05-07T19:46:22.6056188Z ERROR conda.cli.main_run:execute(125): `conda run clang --version` failed. (See above for error) 2025-05-07T19:46:22.6632678Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:46:22.6633894Z 2025-05-07T19:46:22.6656203Z total 56 2025-05-07T19:46:22.6656626Z drwxr-xr-x. 2 root root 16384 May 7 19:46 . 2025-05-07T19:46:22.6657259Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:46:22.6658318Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:46:22.6658852Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:46:22.6659385Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:46:22.6660201Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:46:22.6660658Z -rw-r--r--. 2 root root 872 May 7 16:10 libxml2_activate.sh 2025-05-07T19:46:22.6661089Z -rw-r--r--. 2 root root 499 Mar 28 22:35 openjdk_activate.sh 2025-05-07T19:46:22.6661563Z -rw-r--r--. 2 root root 2932 Nov 20 20:32 ~cuda-nvcc_activate.sh 2025-05-07T19:46:22.6661846Z 2025-05-07T19:46:22.6662095Z [INSTALL] Removing the -ccbin=CXX hook from NVCC activation scripts ... 2025-05-07T19:46:22.6662913Z + sed -i /-ccbin=/d /github/home/miniconda/envs/build_binary/etc/conda/activate.d/*cuda-nvcc_activate.sh 2025-05-07T19:46:22.6663358Z 2025-05-07T19:46:22.6679339Z 2025-05-07T19:46:22.6680738Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:46:22.6682117Z 2025-05-07T19:46:24.5416470Z 2025-05-07T19:46:24.5417635Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:46:24.5418246Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler" 2025-05-07T19:46:24.5418630Z 2025-05-07T19:46:24.9644091Z 2025-05-07T19:46:24.9645156Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:46:24.9645614Z 2025-05-07T19:46:26.7827498Z -allow-unsupported-compiler 2025-05-07T19:46:26.7827888Z 2025-05-07T19:46:26.8584443Z 2025-05-07T19:46:26.8585749Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:46:26.8587384Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:46:26.8588443Z 2025-05-07T19:46:28.7425208Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:46:28.7426362Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:46:28.7426801Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:46:28.7427181Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:46:28.7427574Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:46:28.7427863Z #define _STL_PAIR_H 1 2025-05-07T19:46:28.7428169Z #define __cpp_attributes 200809L 2025-05-07T19:46:28.7428520Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:46:28.7428929Z #define __DELETE_THROW throw() 2025-05-07T19:46:28.7429219Z #define _PTRDIFF_T_ 2025-05-07T19:46:28.7429520Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:46:28.7429908Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:46:28.7430757Z #define _IO_LEFT 02 2025-05-07T19:46:28.7431225Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:46:28.7431689Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:46:28.7432021Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:46:28.7432533Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:46:28.7433299Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:46:28.7433771Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:46:28.7434264Z #define _IOS_OUTPUT 2 2025-05-07T19:46:28.7434722Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:46:28.7435356Z #define toascii_l(c,l) __toascii_l ((c), (l)) 2025-05-07T19:46:28.7456872Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:46:28.7457383Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:46:28.7458736Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:46:28.7460043Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:46:28.7460428Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:46:28.7460764Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:46:28.7461143Z #define _T_WCHAR_ 2025-05-07T19:46:28.7461401Z #define stdout stdout 2025-05-07T19:46:28.7461805Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:46:28.7462800Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:46:28.7463296Z #define __flexarr [] 2025-05-07T19:46:28.7463821Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:46:28.7464191Z #define __islower_l(c,l) __isctype_l((c), _ISlower, (l)) 2025-05-07T19:46:28.7464619Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:46:28.7464910Z #define _MATH_H 1 2025-05-07T19:46:28.7465252Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:46:28.7465627Z #define __S64_TYPE long int 2025-05-07T19:46:28.7465943Z #define __stub_fchflags 2025-05-07T19:46:28.7466237Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:46:28.7466601Z #define __SQUAD_TYPE long int 2025-05-07T19:46:28.7466922Z #define __INTMAX_C(c) c ## L 2025-05-07T19:46:28.7467211Z #define _BSD_SIZE_T_DEFINED_ 2025-05-07T19:46:28.7467528Z #define NL_NMAX INT_MAX 2025-05-07T19:46:28.7467794Z #define _BITS_TIME_H 1 2025-05-07T19:46:28.7468125Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:46:28.7468489Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:46:28.7468857Z #define cudaStreamTailLaunch ((cudaStream_t)0x3) 2025-05-07T19:46:28.7469249Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:46:28.7469715Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:46:28.7470123Z #define __CHAR_BIT__ 8 2025-05-07T19:46:28.7470446Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:28.7471307Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:46:28.7471637Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:46:28.7471963Z #define FP_NAN 0 2025-05-07T19:46:28.7472268Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:46:28.7472789Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:46:28.7473331Z #define cudaGetDeviceProperties cudaGetDeviceProperties_v2 2025-05-07T19:46:28.7473790Z #define __cudaCDP2GetErrorString 2025-05-07T19:46:28.7474133Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:46:28.7474436Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:46:28.7474749Z #define __SM_80_RT_H__ 2025-05-07T19:46:28.7475004Z #define _NEW 2025-05-07T19:46:28.7475291Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:46:28.7475606Z #define __UINT8_MAX__ 0xff 2025-05-07T19:46:28.7476140Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:46:28.7476590Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:46:28.7476975Z #define __USE_ANSI 1 2025-05-07T19:46:28.7477293Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:46:28.7477758Z #define __isupper_l(c,l) __isctype_l((c), _ISupper, (l)) 2025-05-07T19:46:28.7478321Z #define __cudaCDP2Memcpy2DAsync_ptsz 2025-05-07T19:46:28.7478663Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:46:28.7479015Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:46:28.7479333Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:46:28.7479681Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:46:28.7480006Z #define PIPE_BUF 4096 2025-05-07T19:46:28.7480393Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:46:28.7480795Z #define ADJ_TICK 0x4000 2025-05-07T19:46:28.7481136Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:46:28.7481496Z #define MQ_PRIO_MAX 32768 2025-05-07T19:46:28.7481822Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:46:28.7482236Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:46:28.7482750Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:28.7483358Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:46:28.7483766Z #define _XOPEN_SOURCE 700 2025-05-07T19:46:28.7484089Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:46:28.7484395Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:46:28.7484737Z #define __cpp_static_assert 201411L 2025-05-07T19:46:28.7485106Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:46:28.7485514Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:46:28.7485921Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:46:28.7486237Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:46:28.7486605Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:46:28.7486918Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:46:28.7487279Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:28.7487670Z #define __ispunct_l(c,l) __isctype_l((c), _ISpunct, (l)) 2025-05-07T19:46:28.7488085Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:46:28.7488507Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:46:28.7488883Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:28.7489307Z #define __isprint_l(c,l) __isctype_l((c), _ISprint, (l)) 2025-05-07T19:46:28.7489692Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:46:28.7490046Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:46:28.7490368Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:46:28.7490756Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:46:28.7491083Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:46:28.7491508Z #define __DBL_DENORM_MIN__ double(4.94065645841246544176568792868221372e-324L) 2025-05-07T19:46:28.7491927Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:46:28.7492247Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:46:28.7492533Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:28.7492811Z #define __GCC_IEC_559 2 2025-05-07T19:46:28.7493112Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:46:28.7493450Z #define _IO_flockfile(_fp) 2025-05-07T19:46:28.7493723Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:46:28.7493988Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:46:28.7494268Z #define _IOFBF 0 2025-05-07T19:46:28.7494479Z #define __USE_BSD 1 2025-05-07T19:46:28.7494718Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:46:28.7494987Z #define SHRT_MIN (-SHRT_MAX - 1) 2025-05-07T19:46:28.7495279Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:46:28.7495543Z #define _IO_NO_WRITES 8 2025-05-07T19:46:28.7495795Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:46:28.7496167Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:46:28.7496528Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:46:28.7496843Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:46:28.7497210Z #define __cpp_binary_literals 201304L 2025-05-07T19:46:28.7497522Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:46:28.7497841Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:46:28.7498128Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:46:28.7498486Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:46:28.7498897Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:46:28.7499389Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:46:28.7499761Z #define M_PI 3.14159265358979323846 2025-05-07T19:46:28.7500098Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:46:28.7500486Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:46:28.7500816Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:28.7501180Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:46:28.7501479Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:46:28.7501797Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:46:28.7502417Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:46:28.7503079Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:46:28.7503449Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:46:28.7503798Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:46:28.7504146Z #define __cudaCDP2GetErrorName 2025-05-07T19:46:28.7504441Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:46:28.7504751Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:46:28.7505079Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:46:28.7505552Z #define __cpp_variadic_templates 200704L 2025-05-07T19:46:28.7505858Z #define RAND_MAX 2147483647 2025-05-07T19:46:28.7506154Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:46:28.7506579Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:28.7506900Z #define __SM_90_RT_H__ 2025-05-07T19:46:28.7507183Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:46:28.7507452Z #define __COMPAR_FN_T 2025-05-07T19:46:28.7507736Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:46:28.7508006Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:46:28.7508522Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:46:28.7509044Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:46:28.7509423Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:46:28.7509833Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:46:28.7510140Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:46:28.7510517Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:46:28.7510836Z #define __cpp_variable_templates 201304L 2025-05-07T19:46:28.7511371Z #define cudaKernelNodeAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:28.7511916Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:46:28.7512285Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:46:28.7512573Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:46:28.7512911Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:46:28.7513259Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:46:28.7513536Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:46:28.7513838Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:46:28.7514115Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:46:28.7514397Z #define __u_char_defined 2025-05-07T19:46:28.7514720Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:46:28.7515117Z #define STA_PPSERROR 0x0800 2025-05-07T19:46:28.7515383Z #define _GLIBCXX_STD_A std 2025-05-07T19:46:28.7515668Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:46:28.7516036Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:46:28.7516700Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:46:28.7517217Z #define FP_INFINITE 1 2025-05-07T19:46:28.7517615Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:28.7518109Z #define _IO_pid_t __pid_t 2025-05-07T19:46:28.7518388Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:46:28.7518698Z #define __LEAF , __leaf__ 2025-05-07T19:46:28.7518966Z #define PATH_MAX 4096 2025-05-07T19:46:28.7519272Z #define __cpp_rvalue_reference 200610L 2025-05-07T19:46:28.7519634Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:46:28.7520026Z #define _LIMITS_H___ 2025-05-07T19:46:28.7520310Z #define __size_t 2025-05-07T19:46:28.7520643Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:46:28.7521262Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:46:28.7521890Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:46:28.7522260Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:46:28.7522631Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:46:28.7522966Z #define _WCHAR_T_DEFINED 2025-05-07T19:46:28.7523356Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:46:28.7523816Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:46:28.7524163Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:46:28.7524525Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:46:28.7524866Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:46:28.7525171Z #define __INT8_C(c) c 2025-05-07T19:46:28.7525477Z #define __cudaCDP2GetParameterBuffer 2025-05-07T19:46:28.7525802Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:46:28.7526120Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:46:28.7526409Z #define __SM_70_RT_HPP__ 2025-05-07T19:46:28.7526717Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:46:28.7527014Z #define __cpp_variadic_using 201611L 2025-05-07T19:46:28.7527400Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:28.7527787Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:46:28.7528181Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:46:28.7528626Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:46:28.7528904Z #define __cpp_capture_star_this 201603L 2025-05-07T19:46:28.7529258Z #define __cudaCDP2LaunchDeviceV2_ptsz 2025-05-07T19:46:28.7529573Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:46:28.7529987Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:46:28.7530384Z #define NFDBITS __NFDBITS 2025-05-07T19:46:28.7530687Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:46:28.7530984Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:46:28.7531350Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:46:28.7531708Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:46:28.7531978Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:46:28.7532306Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:46:28.7532620Z #define STA_UNSYNC 0x0040 2025-05-07T19:46:28.7532971Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:46:28.7533399Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:46:28.7533794Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:46:28.7534085Z #define __cpp_if_constexpr 201606L 2025-05-07T19:46:28.7534433Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:46:28.7534836Z #define cudaStreamFireAndForget ((cudaStream_t)0x4) 2025-05-07T19:46:28.7535178Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:46:28.7535530Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:46:28.7535870Z #define __daddr_t_defined 2025-05-07T19:46:28.7536152Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:46:28.7536435Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:46:28.7536790Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:46:28.7537303Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:46:28.7537831Z #define _ACRTIMP 2025-05-07T19:46:28.7538091Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:46:28.7538365Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:46:28.7538698Z #define _IOS_BIN 128 2025-05-07T19:46:28.7539057Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:46:28.7539507Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:46:28.7539785Z #define UNDERFLOW 4 2025-05-07T19:46:28.7540041Z #define NAME_MAX 255 2025-05-07T19:46:28.7540290Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:28.7540593Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:46:28.7540905Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:28.7541204Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:46:28.7543336Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:46:28.7543780Z #define __ptr_t void * 2025-05-07T19:46:28.7544051Z #define M_E 2.7182818284590452354 2025-05-07T19:46:28.7544335Z #define cudaSurfaceType1D 0x01 2025-05-07T19:46:28.7544632Z #define __USE_ISOCXX11 1 2025-05-07T19:46:28.7544913Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:46:28.7545277Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:46:28.7545577Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:46:28.7545896Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:46:28.7546218Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:46:28.7546695Z #define cudaSurfaceType2D 0x02 2025-05-07T19:46:28.7547174Z #define __linux 1 2025-05-07T19:46:28.7547436Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:46:28.7547846Z #define cudaDeviceMask 0xff 2025-05-07T19:46:28.7548145Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:46:28.7548499Z #define __CUDA_API_VER_MAJOR__ 12 2025-05-07T19:46:28.7548810Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:46:28.7549164Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:46:28.7549509Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:46:28.7549879Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:46:28.7550243Z #define _BITS_TYPES_H 1 2025-05-07T19:46:28.7550561Z #define ULONG_LONG_MAX (LONG_LONG_MAX * 2ULL + 1ULL) 2025-05-07T19:46:28.7551102Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:46:28.7551444Z #define cudaSurfaceType3D 0x03 2025-05-07T19:46:28.7551793Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:46:28.7552117Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:46:28.7552476Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:46:28.7553337Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:46:28.7554255Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:46:28.7554592Z #define __unix 1 2025-05-07T19:46:28.7554826Z #define MATH_ERRNO 1 2025-05-07T19:46:28.7555088Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:46:28.7555371Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:46:28.7555661Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:46:28.7556016Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:46:28.7556323Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:46:28.7556629Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:46:28.7557108Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:46:28.7557604Z #define __nv_pure__ __location__(nv_pure) 2025-05-07T19:46:28.7557904Z #define CUDARTAPI_CDECL 2025-05-07T19:46:28.7558178Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:46:28.7558457Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:46:28.7558756Z #define __cpp_lib_void_t 201411 2025-05-07T19:46:28.7559016Z #define _POSIX_AIO_MAX 1 2025-05-07T19:46:28.7559267Z #define __SIZE_T 2025-05-07T19:46:28.7559520Z #define isgraph_l(c,l) __isgraph_l ((c), (l)) 2025-05-07T19:46:28.7559863Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:46:28.7560176Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:46:28.7560434Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:46:28.7560713Z #define _ATFILE_SOURCE 1 2025-05-07T19:46:28.7561114Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:46:28.7561579Z #define __WAIT_STATUS void * 2025-05-07T19:46:28.7561843Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:46:28.7562123Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:46:28.7562406Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:46:28.7562751Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:46:28.7563087Z #define __WINT_MIN__ 0U 2025-05-07T19:46:28.7563718Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:46:28.7564452Z #define isdigit_l(c,l) __isdigit_l ((c), (l)) 2025-05-07T19:46:28.7564890Z #define WUNTRACED 2 2025-05-07T19:46:28.7565193Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:46:28.7565504Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:46:28.7565868Z #define NZERO 20 2025-05-07T19:46:28.7566132Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:46:28.7566482Z #define _PSTL_PRAGMA(x) _Pragma(#x) 2025-05-07T19:46:28.7566846Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:46:28.7567163Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:46:28.7567456Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:46:28.7567765Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:46:28.7568188Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:46:28.7568469Z #define SCHAR_MIN (-SCHAR_MAX - 1) 2025-05-07T19:46:28.7568777Z #define EXIT_FAILURE 1 2025-05-07T19:46:28.7569027Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:46:28.7569317Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:46:28.7569588Z #define _SIZE_T_DEFINED_ 2025-05-07T19:46:28.7569876Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:46:28.7570192Z #define __cudaCDP2DeviceGetLimit 2025-05-07T19:46:28.7570535Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:46:28.7570923Z #define __cudaCDP2FuncGetAttributes 2025-05-07T19:46:28.7571220Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:46:28.7571506Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:46:28.7571784Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:46:28.7572194Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:46:28.7572498Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:46:28.7572792Z #define SEEK_DATA 3 2025-05-07T19:46:28.7573007Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:46:28.7573298Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:46:28.7573711Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:46:28.7574085Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:46:28.7574330Z #define __INT64_C(c) c ## L 2025-05-07T19:46:28.7574582Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:46:28.7574915Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:46:28.7575223Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:46:28.7575492Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:46:28.7575769Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:28.7576065Z #define STA_PPSWANDER 0x0400 2025-05-07T19:46:28.7576306Z #define __INT_WCHAR_T_H 2025-05-07T19:46:28.7576548Z #define WSTOPPED 2 2025-05-07T19:46:28.7576786Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:46:28.7577057Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:46:28.7577307Z #define FP_NORMAL 4 2025-05-07T19:46:28.7577533Z #define __cudaCDP2LaunchDevice_ptsz 2025-05-07T19:46:28.7577815Z #define _BITS_TIMEX_H 1 2025-05-07T19:46:28.7578041Z #define _POSIX_LINK_MAX 8 2025-05-07T19:46:28.7578291Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:46:28.7578554Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:46:28.7578821Z #define cudaTextureType1D 0x01 2025-05-07T19:46:28.7579074Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:46:28.7579338Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:46:28.7579612Z #define __isascii(c) (((c) & ~0x7f) == 0) 2025-05-07T19:46:28.7579893Z #define __toascii(c) ((c) & 0x7f) 2025-05-07T19:46:28.7580317Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:46:28.7580751Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:46:28.7581024Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:46:28.7581276Z #define _POSIX_SOURCE 1 2025-05-07T19:46:28.7581520Z #define cudaTextureType2D 0x02 2025-05-07T19:46:28.7581794Z #define _PTR_TRAITS_H 1 2025-05-07T19:46:28.7582101Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:46:28.7582460Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:46:28.7582735Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:46:28.7583094Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:46:28.7583445Z #define cudaTextureType3D 0x03 2025-05-07T19:46:28.7583756Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:46:28.7584032Z #define CLOCK_REALTIME 0 2025-05-07T19:46:28.7584396Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:46:28.7584687Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:28.7585042Z #define __cpp_aligned_new 201606L 2025-05-07T19:46:28.7585372Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:46:28.7585672Z #define cudaEventBlockingSync 0x01 2025-05-07T19:46:28.7586006Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:46:28.7586303Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:46:28.7586663Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:46:28.7586969Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:46:28.7587301Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:46:28.7587565Z #define __GLIBC__ 2 2025-05-07T19:46:28.7587845Z #define __END_DECLS } 2025-05-07T19:46:28.7588111Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:46:28.7588524Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:46:28.7588953Z #define __CONCAT(x,y) x ## y 2025-05-07T19:46:28.7589230Z #define WCONTINUED 8 2025-05-07T19:46:28.7589519Z #define __STDC_HOSTED__ 1 2025-05-07T19:46:28.7589787Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:46:28.7590096Z #define _ALLOCA_H 1 2025-05-07T19:46:28.7590341Z #define __host__ __location__(host) 2025-05-07T19:46:28.7590803Z #define __warndecl(name,msg) extern void name (void) __attribute__((__warning__ (msg))) 2025-05-07T19:46:28.7591244Z #define __SLONG32_TYPE int 2025-05-07T19:46:28.7591613Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:46:28.7591904Z #define _SYS_SELECT_H 1 2025-05-07T19:46:28.7592177Z #define _IO_LINE_BUF 0x200 2025-05-07T19:46:28.7592463Z #define _IOS_NOCREATE 32 2025-05-07T19:46:28.7592722Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:46:28.7593035Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:46:28.7593338Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:46:28.7593657Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:46:28.7593952Z #define __global__ __location__(global) 2025-05-07T19:46:28.7594275Z #define __GNU_LIBRARY__ 6 2025-05-07T19:46:28.7594550Z #define __cpp_decltype_auto 201304L 2025-05-07T19:46:28.7594870Z #define __DBL_DIG__ 15 2025-05-07T19:46:28.7595113Z #define TIME_UTC 1 2025-05-07T19:46:28.7595368Z #define __FLT32_DIG__ 6 2025-05-07T19:46:28.7595731Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:46:28.7596215Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:46:28.7596762Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:46:28.7597203Z #define iscntrl_l(c,l) __iscntrl_l ((c), (l)) 2025-05-07T19:46:28.7597571Z #define _G_BUFSIZ 8192 2025-05-07T19:46:28.7597914Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:46:28.7598364Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:46:28.7598692Z #define __cudaCDP2GetDevice 2025-05-07T19:46:28.7599029Z #define __cudaCDP2PeekAtLastError 2025-05-07T19:46:28.7599377Z #define STA_CLOCKERR 0x1000 2025-05-07T19:46:28.7599655Z #define __GXX_WEAK__ 1 2025-05-07T19:46:28.7599965Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:28.7600300Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:46:28.7600617Z #define __SHRT_WIDTH__ 16 2025-05-07T19:46:28.7600941Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:46:28.7601335Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:46:28.7601639Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:46:28.7601977Z #define isblank_l(c,l) __isblank_l ((c), (l)) 2025-05-07T19:46:28.7602312Z #define _G_config_h 1 2025-05-07T19:46:28.7602646Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:46:28.7603041Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:46:28.7603346Z #define _GCC_WCHAR_T 2025-05-07T19:46:28.7603624Z #define TMP_MAX 238328 2025-05-07T19:46:28.7603885Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:46:28.7604216Z #define __DEVICE_TYPES_H__ 2025-05-07T19:46:28.7604502Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:28.7604834Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:46:28.7605136Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:46:28.7605485Z #define _IO_SKIPWS 01 2025-05-07T19:46:28.7606026Z #define cudaStreamGraphFireAndForgetAsSibling (cudaStream_t)0x0300000000000000 2025-05-07T19:46:28.7606550Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:46:28.7606882Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:46:28.7607239Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:46:28.7607670Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:46:28.7608064Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:46:28.7608596Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:46:28.7608860Z #define le32toh(x) (x) 2025-05-07T19:46:28.7609131Z #define _SIZE_T_DEFINED 2025-05-07T19:46:28.7609400Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:46:28.7609772Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:46:28.7610150Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:46:28.7610557Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:46:28.7611010Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:46:28.7611291Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:46:28.7611589Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:46:28.7611871Z #define _POSIX_NAME_MAX 14 2025-05-07T19:46:28.7612188Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:46:28.7612736Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:46:28.7613348Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:46:28.7613709Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:46:28.7614064Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:46:28.7614422Z #define _WCHAR_T_ 2025-05-07T19:46:28.7614670Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:46:28.7615074Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:46:28.7615452Z #define RTSIG_MAX 32 2025-05-07T19:46:28.7615661Z #define _STDDEF_H 2025-05-07T19:46:28.7615862Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:46:28.7616137Z #define _VA_LIST_DEFINED 2025-05-07T19:46:28.7616398Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:46:28.7616749Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:46:28.7617152Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:46:28.7617481Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:46:28.7617799Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:46:28.7618262Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:46:28.7618806Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:46:28.7619167Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:46:28.7619507Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:46:28.7619836Z #define __unix__ 1 2025-05-07T19:46:28.7620064Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:28.7620341Z #define __INT_WIDTH__ 32 2025-05-07T19:46:28.7620566Z #define __SIZEOF_LONG__ 8 2025-05-07T19:46:28.7620802Z #define _IONBF 2 2025-05-07T19:46:28.7621234Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:46:28.7622022Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:46:28.7622541Z #define __STDC_IEC_559__ 1 2025-05-07T19:46:28.7622827Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:46:28.7623105Z #define __UINT16_C(c) c 2025-05-07T19:46:28.7623324Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:46:28.7623597Z #define STA_DEL 0x0020 2025-05-07T19:46:28.7623820Z #define __CUDACC_VER_MINOR__ 6 2025-05-07T19:46:28.7624067Z #define __id_t_defined 2025-05-07T19:46:28.7624327Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:46:28.7624768Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:46:28.7625176Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:46:28.7625443Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:46:28.7625751Z #define __DECIMAL_DIG__ 21 2025-05-07T19:46:28.7625996Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:46:28.7626251Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:46:28.7626520Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:46:28.7626831Z #define SING 2 2025-05-07T19:46:28.7627058Z #define STA_FREQHOLD 0x0080 2025-05-07T19:46:28.7627370Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:28.7627679Z #define cudaStreamDefault 0x00 2025-05-07T19:46:28.7628064Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:46:28.7628448Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:46:28.7628756Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:46:28.7629074Z #define __gnu_linux__ 1 2025-05-07T19:46:28.7629326Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:46:28.7629625Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:46:28.7629888Z #define MAX_INPUT 255 2025-05-07T19:46:28.7630167Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:46:28.7630506Z #define __isalpha_l(c,l) __isctype_l((c), _ISalpha, (l)) 2025-05-07T19:46:28.7630924Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:46:28.7631255Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:46:28.7631556Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:46:28.7631965Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:46:28.7632499Z #define _IO_SHOWPOS 02000 2025-05-07T19:46:28.7632860Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:46:28.7633234Z #define _Mfloat_ float 2025-05-07T19:46:28.7633554Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:46:28.7633886Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:46:28.7634215Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:46:28.7634705Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:46:28.7635233Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:28.7635516Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:46:28.7635950Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:46:28.7636358Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:46:28.7636855Z #define __USE_ISOC11 1 2025-05-07T19:46:28.7637213Z #define _BSD_SIZE_T_ 2025-05-07T19:46:28.7637475Z #define ADJ_MICRO 0x1000 2025-05-07T19:46:28.7637783Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:46:28.7638085Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:46:28.7638450Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:46:28.7638810Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:46:28.7639176Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:46:28.7639538Z #define __THROW throw () 2025-05-07T19:46:28.7639842Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:46:28.7640193Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:28.7640582Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:46:28.7640990Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:46:28.7641298Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:46:28.7641617Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:46:28.7641907Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:46:28.7642219Z #define L_tmpnam 20 2025-05-07T19:46:28.7642473Z #define ___int_wchar_t_h 2025-05-07T19:46:28.7642877Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:46:28.7643324Z #define isascii(c) __isascii (c) 2025-05-07T19:46:28.7643608Z #define _T_PTRDIFF 2025-05-07T19:46:28.7643956Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:46:28.7644346Z #define toascii(c) __toascii (c) 2025-05-07T19:46:28.7644657Z #define __GNUC__ 11 2025-05-07T19:46:28.7644932Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:46:28.7645291Z #define __GXX_RTTI 1 2025-05-07T19:46:28.7645540Z #define __pie__ 2 2025-05-07T19:46:28.7645816Z #define __MMX__ 1 2025-05-07T19:46:28.7646066Z #define __cudaCDP2Malloc 2025-05-07T19:46:28.7646385Z #define __timespec_defined 1 2025-05-07T19:46:28.7646900Z #define L_ctermid 9 2025-05-07T19:46:28.7647284Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:28.7647655Z #define __cudaCDP2GetParameterBufferV2 2025-05-07T19:46:28.7648075Z #define offsetof(TYPE,MEMBER) __builtin_offsetof (TYPE, MEMBER) 2025-05-07T19:46:28.7648528Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:46:28.7648827Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:46:28.7649183Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:46:28.7649521Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:46:28.7649899Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:46:28.7650205Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:46:28.7650712Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:46:28.7651565Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:28.7652238Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:46:28.7652602Z #define __USE_SVID 1 2025-05-07T19:46:28.7652890Z #define __constant__ __location__(constant) 2025-05-07T19:46:28.7653267Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:46:28.7653594Z #define __device__ __location__(device) 2025-05-07T19:46:28.7653988Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:46:28.7654386Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:46:28.7654770Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:46:28.7655126Z #define CUDART_DEVICE __device__ 2025-05-07T19:46:28.7655515Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:46:28.7655963Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:46:28.7656285Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:46:28.7656721Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:46:28.7657143Z #define __STDC_UTF_16__ 1 2025-05-07T19:46:28.7657460Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:46:28.7657869Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:46:28.7658382Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:46:28.7658773Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:46:28.7659086Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:46:28.7659532Z #define NGROUPS_MAX 65536 2025-05-07T19:46:28.7659817Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:46:28.7660149Z #define __USE_ISOC95 1 2025-05-07T19:46:28.7660410Z #define _TIME_H 1 2025-05-07T19:46:28.7660731Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:46:28.7661074Z #define __USE_ISOC99 1 2025-05-07T19:46:28.7661443Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:46:28.7661869Z #define HOST_NAME_MAX 64 2025-05-07T19:46:28.7662135Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:46:28.7662436Z #define _IOS_ATEND 4 2025-05-07T19:46:28.7662688Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:46:28.7663060Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:46:28.7663491Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:28.7663887Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:46:28.7664194Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:46:28.7664562Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:46:28.7664900Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:46:28.7665214Z #define _STDIO_H 1 2025-05-07T19:46:28.7665678Z #define __isctype_l(c,type,locale) ((locale)->__ctype_b[(int) (c)] & (unsigned short int) type) 2025-05-07T19:46:28.7666188Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:46:28.7666606Z #define __DBL_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:46:28.7667011Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:46:28.7667353Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:46:28.7667645Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:46:28.7667967Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:46:28.7668288Z #define __cpp_raw_strings 200710L 2025-05-07T19:46:28.7668654Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:28.7669087Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:46:28.7669392Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:46:28.7669734Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:46:28.7670073Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:46:28.7670407Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:46:28.7670725Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:46:28.7671149Z #define _ISbit(bit) ((bit) < 8 ? ((1 << (bit)) << 8) : ((1 << (bit)) >> 8)) 2025-05-07T19:46:28.7671546Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:46:28.7671841Z #define __USE_XOPEN 1 2025-05-07T19:46:28.7672139Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:46:28.7672600Z #define cudaStreamAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:28.7673100Z #define __USE_XOPEN2K 1 2025-05-07T19:46:28.7673363Z #define _PSTL_UDR_PRESENT 1 2025-05-07T19:46:28.7673675Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:46:28.7673989Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:46:28.7674309Z #define __cpp_fold_expressions 201603L 2025-05-07T19:46:28.7674855Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:46:28.7675432Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:46:28.7675765Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:46:28.7676216Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:46:28.7676970Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:46:28.7677375Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:46:28.7677831Z #define __END_NAMESPACE_C99 2025-05-07T19:46:28.7678131Z #define __glibcxx_integral_traps true 2025-05-07T19:46:28.7678474Z #define _POSIX_PATH_MAX 256 2025-05-07T19:46:28.7678757Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:46:28.7679066Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:46:28.7679361Z #define _ISOC11_SOURCE 1 2025-05-07T19:46:28.7679669Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:46:28.7680019Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:46:28.7680351Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:46:28.7680773Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:46:28.7681194Z #define LONG_MIN (-LONG_MAX - 1L) 2025-05-07T19:46:28.7681531Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:46:28.7681821Z #define _IO_UNITBUF 020000 2025-05-07T19:46:28.7682127Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:46:28.7682424Z #define __FD_SETSIZE 1024 2025-05-07T19:46:28.7682731Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:46:28.7683040Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:46:28.7683441Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:46:28.7683868Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:46:28.7684172Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:46:28.7684556Z #define isxdigit_l(c,l) __isxdigit_l ((c), (l)) 2025-05-07T19:46:28.7684914Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:46:28.7685243Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:46:28.7685578Z #define __isalnum_l(c,l) __isctype_l((c), _ISalnum, (l)) 2025-05-07T19:46:28.7685986Z #define _WCHAR_T_DEFINED_ 2025-05-07T19:46:28.7686300Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:46:28.7686697Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:46:28.7687042Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:46:28.7687342Z #define __USE_POSIX199506 1 2025-05-07T19:46:28.7687655Z #define _FEATURES_H 1 2025-05-07T19:46:28.7687924Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:46:28.7688387Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:46:28.7688953Z #define __stub_getmsg 2025-05-07T19:46:28.7689242Z #define _IO_FIXED 010000 2025-05-07T19:46:28.7689546Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:46:28.7689918Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:46:28.7690221Z #define __stub_setlogin 2025-05-07T19:46:28.7690514Z #define __stub_fattach 2025-05-07T19:46:28.7690812Z #define __cplusplus 201703L 2025-05-07T19:46:28.7691109Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:46:28.7691540Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:46:28.7691828Z #define INFINITY (__builtin_inff()) 2025-05-07T19:46:28.7692168Z #define _IO_UNBUFFERED 2 2025-05-07T19:46:28.7692680Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:46:28.7693269Z #define _IO_INTERNAL 010 2025-05-07T19:46:28.7693550Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:46:28.7693948Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:28.7694377Z #define __dev_t_defined 2025-05-07T19:46:28.7694648Z #define __DEPRECATED 1 2025-05-07T19:46:28.7694943Z #define __S32_TYPE int 2025-05-07T19:46:28.7695223Z #define __cpp_rvalue_references 200610L 2025-05-07T19:46:28.7695589Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:46:28.7695880Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:46:28.7696203Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:46:28.7696837Z #define cudaKernelNodeAttributePreferredSharedMemoryCarveout cudaLaunchAttributePreferredSharedMemoryCarveout 2025-05-07T19:46:28.7697525Z #define _G_HAVE_MREMAP 1 2025-05-07T19:46:28.7697862Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:46:28.7698264Z #define OVERFLOW 3 2025-05-07T19:46:28.7698550Z #define __toascii_l(c,l) ((l), __toascii (c)) 2025-05-07T19:46:28.7698884Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:46:28.7699285Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:28.7699645Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:46:28.7700029Z #define __SSE2_MATH__ 1 2025-05-07T19:46:28.7700295Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:46:28.7700655Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:28.7700982Z #define _IO_STDIO_H 2025-05-07T19:46:28.7701277Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:46:28.7701622Z #define isspace_l(c,l) __isspace_l ((c), (l)) 2025-05-07T19:46:28.7701966Z #define __cudaCDP2Memcpy2DAsync 2025-05-07T19:46:28.7702325Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:28.7702675Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:46:28.7702996Z #define __amd64 1 2025-05-07T19:46:28.7703244Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:46:28.7703562Z #define __cudaCDP2Memset3DAsync 2025-05-07T19:46:28.7703865Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:46:28.7704203Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:46:28.7704539Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:46:28.7704858Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:46:28.7705205Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:46:28.7705493Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:46:28.7705792Z #define __bounded 2025-05-07T19:46:28.7706049Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:46:28.7706387Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:46:28.7706689Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:46:28.7707010Z #define _PTRDIFF_T_DECLARED 2025-05-07T19:46:28.7707304Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:28.7707675Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:46:28.7708111Z #define cudaStreamAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:28.7708553Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:46:28.7708873Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:46:28.7709234Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:46:28.7709633Z #define STA_PLL 0x0001 2025-05-07T19:46:28.7709894Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:46:28.7710215Z #define __GNUG__ 11 2025-05-07T19:46:28.7710467Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:46:28.7710777Z #define _T_WCHAR 2025-05-07T19:46:28.7711032Z #define __cudaCDP2GetDeviceCount 2025-05-07T19:46:28.7711364Z #define __specialization_static 2025-05-07T19:46:28.7711686Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:46:28.7712048Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:46:28.7712355Z #define cudaArraySparse 0x40 2025-05-07T19:46:28.7712647Z #define STA_PPSFREQ 0x0002 2025-05-07T19:46:28.7712943Z #define __GLIBCXX__ 20230528 2025-05-07T19:46:28.7713308Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:46:28.7713672Z #define _WCHAR_T 2025-05-07T19:46:28.7713913Z #define __cudaCDP2Free 2025-05-07T19:46:28.7714627Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:46:28.7715391Z #define __cpp_nsdmi 200809L 2025-05-07T19:46:28.7715877Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:46:28.7716559Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:46:28.7716873Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:46:28.7717241Z #define cudaArrayCubemap 0x04 2025-05-07T19:46:28.7717603Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:28.7718019Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:46:28.7718291Z #define __NO_CTYPE 1 2025-05-07T19:46:28.7718583Z #define __stub_bdflush 2025-05-07T19:46:28.7718982Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:46:28.7719479Z #define __CORRECT_ISO_CPP_STRING_H_PROTO 2025-05-07T19:46:28.7719852Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:46:28.7720154Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:46:28.7720557Z #define __cpp_initializer_lists 200806L 2025-05-07T19:46:28.7720893Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:46:28.7721245Z #define __U16_TYPE unsigned short int 2025-05-07T19:46:28.7721610Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:46:28.7722022Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:46:28.7722339Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:46:28.7722684Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:46:28.7723085Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:46:28.7723467Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:46:28.7723791Z #define _IO_STDIO 040000 2025-05-07T19:46:28.7724123Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:46:28.7724534Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:46:28.7724856Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:46:28.7725168Z #define _PTRDIFF_T 2025-05-07T19:46:28.7725392Z #define _MOVE_H 1 2025-05-07T19:46:28.7725639Z #define __cpp_hex_float 201603L 2025-05-07T19:46:28.7725908Z #define ADJ_TAI 0x0080 2025-05-07T19:46:28.7726157Z #define __ptrvalue 2025-05-07T19:46:28.7726402Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:46:28.7726655Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:46:28.7726963Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:46:28.7727275Z #define MATH_ERREXCEPT 2 2025-05-07T19:46:28.7727550Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:46:28.7727837Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:46:28.7728259Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:46:28.7728766Z #define __USE_GNU 1 2025-05-07T19:46:28.7729017Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:46:28.7729309Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:46:28.7729572Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:46:28.7729977Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:46:28.7730370Z #define WEXITED 4 2025-05-07T19:46:28.7730602Z #define _IO_NO_READS 4 2025-05-07T19:46:28.7730894Z #define cudaGraphKernelNodePortLaunchCompletion 2 2025-05-07T19:46:28.7731260Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:46:28.7731537Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:46:28.7731844Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:46:28.7732155Z #define __uid_t_defined 2025-05-07T19:46:28.7732409Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:46:28.7732706Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:46:28.7732974Z #define WNOHANG 1 2025-05-07T19:46:28.7733227Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:46:28.7733531Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:46:28.7733897Z #define cudaEventDefault 0x00 2025-05-07T19:46:28.7734196Z #define __maxnreg__(a) __attribute__((maxnreg(a))) 2025-05-07T19:46:28.7734531Z #define NL_SETMAX INT_MAX 2025-05-07T19:46:28.7734761Z #define __x86_64 1 2025-05-07T19:46:28.7735001Z #define __cudaCDP2LaunchDevice 2025-05-07T19:46:28.7735396Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:28.7735897Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:46:28.7736421Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:28.7736863Z #define __PTRDIFF_T 2025-05-07T19:46:28.7737197Z #define __exctype_l(name) extern int name (int, __locale_t) __THROW 2025-05-07T19:46:28.7737580Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:46:28.7737865Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:28.7738153Z #define _Mlong_double_ long double 2025-05-07T19:46:28.7738441Z #define __cpp_lambdas 200907L 2025-05-07T19:46:28.7738710Z #define _IO_DEC 020 2025-05-07T19:46:28.7738931Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:46:28.7739208Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:46:28.7739492Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:46:28.7739785Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:46:28.7740042Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:46:28.7740409Z #define __cudaCDP2DeviceGetSharedMemConfig 2025-05-07T19:46:28.7740726Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:46:28.7741009Z #define _ANSI_STDDEF_H 2025-05-07T19:46:28.7741275Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:46:28.7741607Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:46:28.7741988Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:46:28.7742375Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:46:28.7742675Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:46:28.7742983Z #define __cpp_template_auto 201606L 2025-05-07T19:46:28.7743372Z #define __DBL_MIN__ double(2.22507385850720138309023271733240406e-308L) 2025-05-07T19:46:28.7743745Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:46:28.7744062Z #define __key_t_defined 2025-05-07T19:46:28.7744321Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:46:28.7744704Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:46:28.7745225Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:46:28.7745596Z #define __GNUC_VA_LIST 2025-05-07T19:46:28.7745944Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:46:28.7746328Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:46:28.7746738Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:46:28.7747189Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:46:28.7747638Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:46:28.7747980Z #define __WCOREFLAG 0x80 2025-05-07T19:46:28.7748260Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:46:28.7748569Z #define cudaEventDisableTiming 0x02 2025-05-07T19:46:28.7748906Z #define __LP64__ 1 2025-05-07T19:46:28.7749205Z #define __isascii_l(c,l) ((l), __isascii (c)) 2025-05-07T19:46:28.7749551Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:46:28.7749888Z #define _IO_off64_t __off64_t 2025-05-07T19:46:28.7750182Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:28.7750499Z #define __time_t_defined 1 2025-05-07T19:46:28.7750774Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:46:28.7751186Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:46:28.7751592Z #define __USE_UNIX98 1 2025-05-07T19:46:28.7751895Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:46:28.7752225Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:46:28.7752528Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:46:28.7752896Z #define __LEAF_ATTR __attribute__ ((__leaf__)) 2025-05-07T19:46:28.7753246Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:46:28.7753570Z #define SEEK_CUR 1 2025-05-07T19:46:28.7753831Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:28.7754264Z #define _ASSERT_H 1 2025-05-07T19:46:28.7754895Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:46:28.7755611Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:46:28.7756024Z #define CHAR_MAX SCHAR_MAX 2025-05-07T19:46:28.7756321Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:46:28.7756650Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:46:28.7756963Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:46:28.7757415Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:28.7757872Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:46:28.7758642Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:46:28.7759369Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:46:28.7759737Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:46:28.7760160Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:46:28.7760581Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:46:28.7760923Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:46:28.7761247Z #define cudaArrayDefault 0x00 2025-05-07T19:46:28.7761597Z #define __cudaCDP2LaunchDeviceV2 2025-05-07T19:46:28.7762032Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:46:28.7762381Z #define TLOSS 5 2025-05-07T19:46:28.7762634Z #define __ssize_t_defined 2025-05-07T19:46:28.7762959Z #define __CUDACC_VER_BUILD__ 85 2025-05-07T19:46:28.7763302Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:46:28.7763636Z #define ULONG_MAX (LONG_MAX * 2UL + 1UL) 2025-05-07T19:46:28.7764003Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:46:28.7764401Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:46:28.7764868Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:46:28.7765194Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:46:28.7765562Z #define __cudaCDP2EventRecordWithFlags 2025-05-07T19:46:28.7765912Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:46:28.7766275Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:46:28.7766604Z #define __REGISTER_PREFIX__ 2025-05-07T19:46:28.7766936Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:46:28.7767333Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:46:28.7767734Z #define _IOS_NOREPLACE 64 2025-05-07T19:46:28.7768120Z #define __cdecl 2025-05-07T19:46:28.7768365Z #define cudaEventInterprocess 0x04 2025-05-07T19:46:28.7768724Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:46:28.7769066Z #define LOGIN_NAME_MAX 256 2025-05-07T19:46:28.7769364Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:46:28.7769639Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:46:28.7769962Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:46:28.7770238Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:46:28.7770587Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:46:28.7770957Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:46:28.7771376Z #define __NV_GLIBCXX_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:28.7771850Z #define ADJ_NANO 0x2000 2025-05-07T19:46:28.7772175Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:46:28.7772567Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:46:28.7792898Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:46:28.7793200Z #define __FLT_DIG__ 6 2025-05-07T19:46:28.7793547Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:46:28.7793950Z #define __NO_INLINE__ 1 2025-05-07T19:46:28.7794241Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:28.7794558Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:46:28.7794807Z #define ADJ_STATUS 0x0010 2025-05-07T19:46:28.7795078Z #define __cudaCDP2MemcpyAsync_ptsz 2025-05-07T19:46:28.7795392Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:46:28.7795803Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:28.7796246Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:46:28.7796721Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:46:28.7797150Z #define cudaStreamGraphFireAndForget (cudaStream_t)0x0200000000000000 2025-05-07T19:46:28.7797586Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:46:28.7797949Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:46:28.7798320Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:46:28.7798551Z #define MAX_CANON 255 2025-05-07T19:46:28.7798785Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:46:28.7799040Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:46:28.7799347Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:46:28.7799658Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:46:28.7800010Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:46:28.7800367Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:46:28.7800677Z #define __cudaCDP2Memset2DAsync_ptsz 2025-05-07T19:46:28.7801060Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:46:28.7801398Z #define __VERSION__ "11.4.0" 2025-05-07T19:46:28.7801698Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:46:28.7802016Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:46:28.7802340Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:46:28.7802645Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:46:28.7803075Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:46:28.7803398Z #define __UINT64_C(c) c ## UL 2025-05-07T19:46:28.7803703Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:46:28.7804015Z #define _SYS_TYPES_H 1 2025-05-07T19:46:28.7804281Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:46:28.7804594Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:46:28.7804875Z #define _SYS_CDEFS_H 1 2025-05-07T19:46:28.7805146Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:46:28.7805443Z #define __cpp_unicode_characters 201411L 2025-05-07T19:46:28.7805784Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:46:28.7806067Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:46:28.7806368Z #define __cudaCDP2StreamDestroy 2025-05-07T19:46:28.7806662Z #define FP_SUBNORMAL 3 2025-05-07T19:46:28.7806967Z #define cudaOccupancyDefault 0x00 2025-05-07T19:46:28.7807291Z #define _INITIALIZER_LIST 2025-05-07T19:46:28.7807570Z #define _STDC_PREDEF_H 1 2025-05-07T19:46:28.7807864Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:46:28.7808174Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:46:28.7808616Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:46:28.7808875Z #define _IO_file_flags _flags 2025-05-07T19:46:28.7809155Z #define __USE_XOPEN2K8 1 2025-05-07T19:46:28.7809419Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:46:28.7809724Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:46:28.7810007Z #define HUGE 3.40282347e+38F 2025-05-07T19:46:28.7810314Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:46:28.7810702Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:46:28.7811103Z #define islower_l(c,l) __islower_l ((c), (l)) 2025-05-07T19:46:28.7811436Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:46:28.7811704Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:46:28.7811990Z #define _BSD_SOURCE 1 2025-05-07T19:46:28.7812227Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:46:28.7813105Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_ ##_NTYPE : false_type { }; template struct __has_ ##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:46:28.7813981Z #define __catch(X) catch(X) 2025-05-07T19:46:28.7814254Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:46:28.7814585Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:46:28.7814875Z #define __TIMER_T_TYPE void * 2025-05-07T19:46:28.7815152Z #define __STRING(x) #x 2025-05-07T19:46:28.7815411Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:28.7815726Z #define _T_PTRDIFF_ 2025-05-07T19:46:28.7815973Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:46:28.7816305Z #define cudaEventWaitExternal 0x01 2025-05-07T19:46:28.7816640Z #define __unbounded 2025-05-07T19:46:28.7816921Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:28.7817230Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:46:28.7817523Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:28.7817848Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:46:28.7818137Z #define __cpp_lib_is_final 201402L 2025-05-07T19:46:28.7818460Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:46:28.7818790Z #define LONG_LONG_MIN (-LONG_LONG_MAX - 1LL) 2025-05-07T19:46:28.7819129Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:46:28.7819407Z #define __managed__ __location__(managed) 2025-05-07T19:46:28.7819734Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:46:28.7820138Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:28.7820596Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:46:28.7820879Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:46:28.7821262Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:46:28.7821695Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:46:28.7821957Z #define _SYS_SIZE_T_H 2025-05-07T19:46:28.7822290Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:46:28.7822633Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:46:28.7822945Z #define isupper_l(c,l) __isupper_l ((c), (l)) 2025-05-07T19:46:28.7824044Z #define _CRTIMP 2025-05-07T19:46:28.7824305Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:46:28.7824634Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:28.7824946Z #define STA_PPSJITTER 0x0200 2025-05-07T19:46:28.7825293Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:46:28.7825681Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:28.7825998Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:46:28.7826256Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:46:28.7826542Z #define __SIZE_T__ 2025-05-07T19:46:28.7826745Z #define __stub_gtty 2025-05-07T19:46:28.7826978Z #define __pid_t_defined 2025-05-07T19:46:28.7827230Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:46:28.7827513Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:28.7827822Z #define __glibcxx_function_requires(...) 2025-05-07T19:46:28.7828100Z #define __SM_80_RT_HPP__ 2025-05-07T19:46:28.7828345Z #define __need_clockid_t 2025-05-07T19:46:28.7828569Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:46:28.7828823Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:46:28.7829122Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:46:28.7829435Z #define _IO_HEX 0100 2025-05-07T19:46:28.7829676Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:46:28.7830007Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:46:28.7830470Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:46:28.7830785Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:46:28.7831219Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:28.7831709Z #define ispunct_l(c,l) __ispunct_l ((c), (l)) 2025-05-07T19:46:28.7832041Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:46:28.7832361Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:46:28.7832671Z #define __cudaCDP2Memcpy3DAsync 2025-05-07T19:46:28.7832843Z #define __cudaCDP2MemcpyAsync 2025-05-07T19:46:28.7832939Z #define __stub_sstk 2025-05-07T19:46:28.7833046Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:46:28.7833214Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:46:28.7833327Z #define __wur 2025-05-07T19:46:28.7833458Z #define isprint_l(c,l) __isprint_l ((c), (l)) 2025-05-07T19:46:28.7833561Z #define _G_HAVE_MMAP 1 2025-05-07T19:46:28.7833672Z #define _IO_OCT 040 2025-05-07T19:46:28.7833782Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:46:28.7833886Z #define NL_MSGMAX INT_MAX 2025-05-07T19:46:28.7833980Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:46:28.7834131Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:46:28.7834231Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:46:28.7834410Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:46:28.7834628Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:46:28.7834730Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:46:28.7834830Z #define _STL_ALGOBASE_H 1 2025-05-07T19:46:28.7834943Z #define __cudaCDP2MemsetAsync_ptsz 2025-05-07T19:46:28.7835064Z #define __off64_t_defined 2025-05-07T19:46:28.7835176Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:46:28.7835278Z #define __FLT128_DIG__ 33 2025-05-07T19:46:28.7835423Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:46:28.7835533Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:46:28.7835634Z #define __INT32_C(c) c 2025-05-07T19:46:28.7835749Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:46:28.7835954Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:46:28.7836072Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:46:28.7836179Z #define __PDP_ENDIAN 3412 2025-05-07T19:46:28.7836475Z #define _ISOC95_SOURCE 1 2025-05-07T19:46:28.7836586Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:46:28.7836738Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:46:28.7836878Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:46:28.7836980Z #define __SM_90_RT_HPP__ 2025-05-07T19:46:28.7837088Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:46:28.7837221Z #define __have_pthread_attr_t 1 2025-05-07T19:46:28.7837428Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:46:28.7837668Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:46:28.7837783Z #define __cudaCDP2StreamWaitEvent 2025-05-07T19:46:28.7837908Z #define __cudaCDP2EventRecord 2025-05-07T19:46:28.7838010Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:46:28.7838118Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:46:28.7838217Z #define htole32(x) (x) 2025-05-07T19:46:28.7838488Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessorWithFlags 2025-05-07T19:46:28.7838625Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:46:28.7838738Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:46:28.7838932Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:46:28.7839082Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:46:28.7839216Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:46:28.7839369Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:46:28.7839499Z #define ADJ_OFFSET 0x0001 2025-05-07T19:46:28.7839610Z #define cudaArrayLayered 0x01 2025-05-07T19:46:28.7839790Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:46:28.7839938Z #define cudaEventRecordDefault 0x00 2025-05-07T19:46:28.7840042Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:46:28.7840148Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:46:28.7840264Z #define unix 1 2025-05-07T19:46:28.7840362Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:46:28.7840463Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:46:28.7840565Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:46:28.7840716Z #define __cudaCDP2DeviceGetCacheConfig 2025-05-07T19:46:28.7840816Z #define __USE_POSIX 1 2025-05-07T19:46:28.7840925Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:46:28.7841093Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:46:28.7841199Z #define __THROWNL throw () 2025-05-07T19:46:28.7841295Z #define __cpp_rtti 199711L 2025-05-07T19:46:28.7841416Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:46:28.7841544Z #define __PMT(args) args 2025-05-07T19:46:28.7841667Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:28.7841823Z #define __va_arg_pack_len() __builtin_va_arg_pack_len () 2025-05-07T19:46:28.7841969Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:46:28.7842071Z #define _SIZE_T_DECLARED 2025-05-07T19:46:28.7842171Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:46:28.7842269Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:46:28.7842716Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:46:28.7842883Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:46:28.7842984Z #define XATTR_LIST_MAX 65536 2025-05-07T19:46:28.7843109Z #define __CUDACC_VER_MAJOR__ 12 2025-05-07T19:46:28.7843260Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:46:28.7843351Z #define _WCHAR_T_H 2025-05-07T19:46:28.7843448Z #define __FLT64X_DIG__ 18 2025-05-07T19:46:28.7843571Z #define _IO_SHOWBASE 0200 2025-05-07T19:46:28.7843669Z #define _POSIX_QLIMIT 1 2025-05-07T19:46:28.7843777Z #define __INT8_TYPE__ signed char 2025-05-07T19:46:28.7843900Z #define __SURFACE_TYPES_H__ 2025-05-07T19:46:28.7844001Z #define __CUDA_ARCH__ 520 2025-05-07T19:46:28.7844118Z #define __cpp_digit_separators 201309L 2025-05-07T19:46:28.7844211Z #define __ELF__ 1 2025-05-07T19:46:28.7844347Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:46:28.7844459Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:46:28.7844560Z #define STA_INS 0x0010 2025-05-07T19:46:28.7844687Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:46:28.7844873Z #define _toupper(c) ((int) (*__ctype_toupper_loc ())[(int) (c)]) 2025-05-07T19:46:28.7844978Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:46:28.7845086Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:46:28.7845229Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:28.7845352Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:46:28.7845532Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:46:28.7845674Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:46:28.7845783Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:46:28.7845953Z #define __warnattr(msg) __attribute__((__warning__ (msg))) 2025-05-07T19:46:28.7846147Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:46:28.7846259Z #define _IO_funlockfile(_fp) 2025-05-07T19:46:28.7846772Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:28.7846919Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:46:28.7847039Z #define __DRIVER_TYPES_H__ 2025-05-07T19:46:28.7847141Z #define __FLT_RADIX__ 2 2025-05-07T19:46:28.7847258Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:46:28.7847454Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:46:28.7847565Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:46:28.7847674Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:46:28.7847793Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:46:28.7847924Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:46:28.7848033Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:46:28.7848152Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:46:28.7848261Z #define WORD_BIT 32 2025-05-07T19:46:28.7848361Z #define _IO_USER_BUF 1 2025-05-07T19:46:28.7848468Z #define __VECTOR_TYPES_H__ 2025-05-07T19:46:28.7848587Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:28.7848727Z #define cudaHostAllocPortable 0x01 2025-05-07T19:46:28.7848838Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:46:28.7848951Z #define __long_double_t long double 2025-05-07T19:46:28.7849079Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:46:28.7849178Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:46:28.7849605Z #define cudaKernelNodeAttributeDeviceUpdatableKernelNode cudaLaunchAttributeDeviceUpdatableKernelNode 2025-05-07T19:46:28.7849729Z #define __k8 1 2025-05-07T19:46:28.7849943Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:46:28.7850134Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:46:28.7850267Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:46:28.7850401Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:46:28.7850506Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:46:28.7850625Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:46:28.7850761Z #define __blksize_t_defined 2025-05-07T19:46:28.7850869Z #define _IO_SHOWPOINT 0400 2025-05-07T19:46:28.7850979Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:46:28.7851106Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:46:28.7851341Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:46:28.7851461Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:28.7851569Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:46:28.7851704Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:46:28.7851982Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:46:28.7852347Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:46:28.7852461Z #define UCHAR_MAX (SCHAR_MAX * 2 + 1) 2025-05-07T19:46:28.7852599Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:46:28.7852694Z #define SEEK_SET 0 2025-05-07T19:46:28.7852808Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:46:28.7852952Z #define __CUDA_API_VER_MINOR__ 6 2025-05-07T19:46:28.7853155Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:46:28.7853267Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:46:28.7853406Z #define __cudaCDP2GetLastError 2025-05-07T19:46:28.7853517Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:46:28.7853617Z #define _MATH_H_MATHDEF 1 2025-05-07T19:46:28.7853961Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:46:28.7854103Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:46:28.7854294Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:46:28.7854397Z #define __stub_sigreturn 2025-05-07T19:46:28.7854683Z #define __errordecl(name,msg) extern void name (void) __attribute__((__error__ (msg))) 2025-05-07T19:46:28.7854790Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:46:28.7854889Z #define __HOST_CONFIG_H__ 2025-05-07T19:46:28.7855002Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:46:28.7855138Z #define CLOCK_TAI 11 2025-05-07T19:46:28.7855255Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:46:28.7855347Z #define __restrict_arr 2025-05-07T19:46:28.7855504Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:46:28.7855664Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:46:28.7856237Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:28.7856467Z #define __attribute_artificial__ __attribute__ ((__artificial__)) 2025-05-07T19:46:28.7856573Z #define __USE_MISC 1 2025-05-07T19:46:28.7856688Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:46:28.7856800Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:46:28.7856917Z #define _GCC_LIMITS_H_ 2025-05-07T19:46:28.7857022Z #define __LDBL_DIG__ 18 2025-05-07T19:46:28.7857130Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:46:28.7857260Z #define __malloc_and_calloc_defined 2025-05-07T19:46:28.7857362Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:46:28.7857479Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:46:28.7857574Z #define __x86_64__ 1 2025-05-07T19:46:28.7857688Z #define _SIZE_T_ 2025-05-07T19:46:28.7858778Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:46:28.7858893Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:46:28.7859116Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:46:28.7859235Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:46:28.7859357Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:46:28.7859468Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:46:28.7859585Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:46:28.7859709Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:46:28.7859880Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:46:28.7859980Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:46:28.7860497Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:28.7860632Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:46:28.7860805Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:46:28.7860921Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:46:28.7861024Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:46:28.7861138Z #define STA_FLL 0x0008 2025-05-07T19:46:28.7861278Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:46:28.7861380Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:46:28.7861505Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:28.7861646Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:46:28.7861733Z #define __stub_revoke 2025-05-07T19:46:28.7861830Z #define __timer_t_defined 1 2025-05-07T19:46:28.7862005Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:46:28.7862101Z #define INT_MAX __INT_MAX__ 2025-05-07T19:46:28.7862211Z #define ULLONG_MAX (LLONG_MAX * 2ULL + 1) 2025-05-07T19:46:28.7862353Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:46:28.7862458Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:46:28.7862566Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:46:28.7862725Z #define cudaArrayTextureGather 0x08 2025-05-07T19:46:28.7862854Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:46:28.7863001Z #define __inline_hint__ __attribute__((nv_inline_hint)) 2025-05-07T19:46:28.7863103Z #define __NV_LEGACY_LAUNCH 1 2025-05-07T19:46:28.7863233Z #define _IO_off_t __off_t 2025-05-07T19:46:28.7863324Z #define __FLT64_DIG__ 15 2025-05-07T19:46:28.7863547Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:46:28.7863657Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:46:28.7863817Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:28.7863932Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:46:28.7864029Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:46:28.7864159Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:46:28.7864251Z #define NULL __null 2025-05-07T19:46:28.7864377Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:46:28.7864487Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:46:28.7864615Z #define __U64_TYPE unsigned long int 2025-05-07T19:46:28.7864722Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:46:28.7864815Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:46:28.7864935Z #define FP_ZERO 2 2025-05-07T19:46:28.7865044Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:46:28.7865193Z #define __isgraph_l(c,l) __isctype_l((c), _ISgraph, (l)) 2025-05-07T19:46:28.7865300Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:28.7865418Z #define __WCHAR_T__ 2025-05-07T19:46:28.7865521Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:46:28.7865714Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:46:28.7865896Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:46:28.7865998Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:46:28.7866118Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:46:28.7866260Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:28.7866392Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:46:28.7866523Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:46:28.7866614Z #define _BSD_PTRDIFF_T_ 2025-05-07T19:46:28.7866728Z #define _SIGSET_H_types 1 2025-05-07T19:46:28.7866839Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:46:28.7866947Z #define __cpp_unicode_literals 200710L 2025-05-07T19:46:28.7867114Z #define __isdigit_l(c,l) __isctype_l((c), _ISdigit, (l)) 2025-05-07T19:46:28.7867224Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:46:28.7867343Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:46:28.7867474Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:46:28.7867602Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:46:28.7867783Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:46:28.7867959Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:46:28.7868078Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:46:28.7868186Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:46:28.7868290Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:46:28.7868386Z #define STA_MODE 0x4000 2025-05-07T19:46:28.7868514Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:46:28.7868620Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:46:28.7868740Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:46:28.7868868Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:46:28.7868969Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:46:28.7869079Z #define __cudaCDP2EventRecord_ptsz 2025-05-07T19:46:28.7869196Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:46:28.7869306Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:46:28.7869396Z #define __SIZE_WIDTH__ 64 2025-05-07T19:46:28.7869521Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:28.7869625Z #define __SEG_FS 1 2025-05-07T19:46:28.7869726Z #define _IO_size_t size_t 2025-05-07T19:46:28.7869829Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:46:28.7869956Z #define INT_MIN (-INT_MAX - 1) 2025-05-07T19:46:28.7870042Z #define __stub_lchmod 2025-05-07T19:46:28.7870191Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:46:28.7870307Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:28.7870430Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:46:28.7870515Z #define __SEG_GS 1 2025-05-07T19:46:28.7870698Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:46:28.7870804Z #define _IOS_APPEND 8 2025-05-07T19:46:28.7870898Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:46:28.7870999Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:46:28.7871101Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:46:28.7871218Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:46:28.7871319Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:46:28.7871417Z #define htole16(x) (x) 2025-05-07T19:46:28.7871518Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:28.7871608Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:46:28.7871695Z #define __INT16_TYPE__ short int 2025-05-07T19:46:28.7871807Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:46:28.7871915Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:46:28.7872023Z #define __cpp_structured_bindings 201606L 2025-05-07T19:46:28.7872153Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:46:28.7872239Z #define __SIZEOF_INT__ 4 2025-05-07T19:46:28.7872320Z #define __WCLONE 0x80000000 2025-05-07T19:46:28.7872405Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:46:28.7872495Z #define SEEK_HOLE 4 2025-05-07T19:46:28.7872578Z #define TIMER_ABSTIME 1 2025-05-07T19:46:28.7872667Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:46:28.7872761Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:46:28.7872925Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:46:28.7873029Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:28.7873115Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:46:28.7873229Z #define __cpp_sized_deallocation 201309L 2025-05-07T19:46:28.7873315Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:46:28.7873427Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:46:28.7873530Z #define _LINUX_LIMITS_H 2025-05-07T19:46:28.7873607Z #define linux 1 2025-05-07T19:46:28.7873689Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:46:28.7873790Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:46:28.7873894Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:46:28.7873980Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:46:28.7874076Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:46:28.7874229Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:46:28.7874322Z #define __cpp_lib_hypot 201603 2025-05-07T19:46:28.7874406Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:46:28.7874569Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:46:28.7874656Z #define MOD_NANO ADJ_NANO 2025-05-07T19:46:28.7874738Z #define htole64(x) (x) 2025-05-07T19:46:28.7874829Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:46:28.7874962Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:46:28.7875051Z #define _IO_UPPERCASE 01000 2025-05-07T19:46:28.7875522Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:46:28.7875620Z #define __USE_POSIX2 1 2025-05-07T19:46:28.7875708Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:46:28.7875792Z #define __WALL 0x40000000 2025-05-07T19:46:28.7875974Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:46:28.7876070Z #define _XLOCALE_H 1 2025-05-07T19:46:28.7876160Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:46:28.7876421Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:46:28.7876526Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:46:28.7876634Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:46:28.7876745Z #define __EXCEPTIONS 1 2025-05-07T19:46:28.7876865Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:46:28.7877099Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:46:28.7877230Z #define __WORDSIZE 64 2025-05-07T19:46:28.7877336Z #define CLOCK_MONOTONIC 1 2025-05-07T19:46:28.7877451Z #define _STL_RELOPS_H 1 2025-05-07T19:46:28.7877619Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:46:28.7877734Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:46:28.7877844Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:46:28.7877960Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:46:28.7878071Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:46:28.7878387Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:46:28.7878660Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:28.7878793Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:46:28.7878905Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:46:28.7879042Z #define __cpp_range_based_for 201603L 2025-05-07T19:46:28.7879163Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:46:28.7879273Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:46:28.7879396Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:46:28.7879598Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:46:28.7879704Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:46:28.7879814Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:46:28.7879951Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:46:28.7880139Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:46:28.7880268Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16 2025-05-07T19:46:28.7880353Z #define _STRING_H 1 2025-05-07T19:46:28.7880487Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:46:28.7880592Z #define _GCC_MAX_ALIGN_T 2025-05-07T19:46:28.7880701Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:46:28.7880874Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:46:28.7880981Z #define __code_model_small__ 1 2025-05-07T19:46:28.7881080Z #define _PSTL_CONFIG_H 2025-05-07T19:46:28.7881194Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:28.7881341Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:46:28.7881448Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:46:28.7881563Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:46:28.7881948Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:28.7882063Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:46:28.7882162Z #define le64toh(x) (x) 2025-05-07T19:46:28.7882288Z #define FILENAME_MAX 4096 2025-05-07T19:46:28.7882444Z #define __iscntrl_l(c,l) __isctype_l((c), _IScntrl, (l)) 2025-05-07T19:46:28.7882566Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:46:28.7882659Z #define L_cuserid 9 2025-05-07T19:46:28.7882776Z #define __ino_t_defined 2025-05-07T19:46:28.7882857Z #define __k8__ 1 2025-05-07T19:46:28.7883015Z #define __INTPTR_TYPE__ long int 2025-05-07T19:46:28.7883157Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:46:28.7883253Z #define __int8_t_defined 2025-05-07T19:46:28.7883348Z #define __WCHAR_TYPE__ int 2025-05-07T19:46:28.7883459Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:46:28.7883605Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:46:28.7883716Z #define __SLONGWORD_TYPE long int 2025-05-07T19:46:28.7883808Z #define _IOS_TRUNC 16 2025-05-07T19:46:28.7883960Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:46:28.7884122Z #define __isblank_l(c,l) __isctype_l((c), _ISblank, (l)) 2025-05-07T19:46:28.7884217Z #define __HAVE_COLUMN 2025-05-07T19:46:28.7884308Z #define __stub_fdetach 2025-05-07T19:46:28.7884768Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:46:28.7884855Z #define __pic__ 2 2025-05-07T19:46:28.7884977Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:28.7885087Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:46:28.7885183Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:46:28.7885285Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:46:28.7885372Z #define __stub_chflags 2025-05-07T19:46:28.7885474Z #define CLOCK_BOOTTIME 7 2025-05-07T19:46:28.7885617Z #define __need_IOV_MAX 2025-05-07T19:46:28.7885726Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:46:28.7885844Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:46:28.7885941Z #define __cpp_decltype 200707L 2025-05-07T19:46:28.7886037Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:46:28.7886125Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:46:28.7886242Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:46:28.7886327Z #define TTY_NAME_MAX 32 2025-05-07T19:46:28.7886492Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:46:28.7886627Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:28.7886801Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:46:28.7886918Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:46:28.7887023Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:46:28.7887113Z #define STA_PPSTIME 0x0004 2025-05-07T19:46:28.7887198Z #define __import__ 2025-05-07T19:46:28.7887289Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:46:28.7887447Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:46:28.7887533Z #define __export__ 2025-05-07T19:46:28.7887656Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:46:28.7887769Z #define cudaMemAttachHost 0x02 2025-05-07T19:46:28.7887936Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:46:28.7888036Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:46:28.7888131Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:46:28.7888239Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:46:28.7888332Z #define _WCHAR_T_DECLARED 2025-05-07T19:46:28.7888563Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:46:28.7888689Z #define isalpha_l(c,l) __isalpha_l ((c), (l)) 2025-05-07T19:46:28.7888785Z #define __cpp_inline_variables 201606L 2025-05-07T19:46:28.7888869Z #define WNOWAIT 0x01000000 2025-05-07T19:46:28.7888948Z #define PLOSS 6 2025-05-07T19:46:28.7889045Z #define M_LN10 2.30258509299404568402 2025-05-07T19:46:28.7889297Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:46:28.7889385Z #define EXIT_SUCCESS 0 2025-05-07T19:46:28.7889488Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:46:28.7889575Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:46:28.7889669Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:46:28.7889754Z #define __thread__ __thread 2025-05-07T19:46:28.7889860Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:46:28.7889947Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:46:28.7890044Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:46:28.7890273Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:28.7890433Z #define __cudaCDP2StreamWaitEvent_ptsz 2025-05-07T19:46:28.7890523Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:46:28.7890603Z #define __linux__ 1 2025-05-07T19:46:28.7890702Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:46:28.7890821Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:46:28.7890908Z #define __S16_TYPE short int 2025-05-07T19:46:28.7891260Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:46:28.7891360Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:46:28.7891541Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:46:28.7891647Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:46:28.7891741Z #define UINT_MAX (INT_MAX * 2U + 1U) 2025-05-07T19:46:28.7891824Z #define _T_SIZE_ 2025-05-07T19:46:28.7891925Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:28.7892048Z #define __cudaCDP2StreamCreateWithFlags 2025-05-07T19:46:28.7892138Z #define _PSTL_VERSION 12000 2025-05-07T19:46:28.7892248Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:46:28.7892364Z #define __WNOTHREAD 0x20000000 2025-05-07T19:46:28.7892452Z #define _G_va_list __gnuc_va_list 2025-05-07T19:46:28.7892573Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:46:28.7892720Z #define _IOS_INPUT 1 2025-05-07T19:46:28.7892803Z #define __USE_LARGEFILE64 1 2025-05-07T19:46:28.7892899Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:46:28.7892985Z #define __INT64_TYPE__ long int 2025-05-07T19:46:28.7893088Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:46:28.7893180Z #define __shared__ __location__(shared) 2025-05-07T19:46:28.7893262Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:46:28.7893420Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:46:28.7893502Z #define __gid_t_defined 2025-05-07T19:46:28.7893607Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:46:28.7893697Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:46:28.7893896Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:46:28.7893985Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:46:28.7894069Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:46:28.7894162Z #define ___int_size_t_h 2025-05-07T19:46:28.7894263Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:28.7894377Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:46:28.7894522Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:46:28.7894631Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:46:28.7894719Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:46:28.7894807Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:46:28.7894908Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:46:28.7895021Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:28.7895121Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:46:28.7895231Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:46:28.7895328Z #define __clock_t_defined 1 2025-05-07T19:46:28.7895426Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:46:28.7895522Z #define __cudaCDP2RuntimeGetVersion 2025-05-07T19:46:28.7895618Z #define __GLIBC_MINOR__ 17 2025-05-07T19:46:28.7895704Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:46:28.7895791Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:46:28.7895901Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:46:28.7895988Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:46:28.7896150Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:46:28.7896222Z #define __SSE__ 1 2025-05-07T19:46:28.7896325Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:46:28.7896415Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:46:28.7896492Z #define _CTYPE_H 1 2025-05-07T19:46:28.7896585Z #define __sigset_t_defined 2025-05-07T19:46:28.7896672Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:46:28.7896760Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:46:28.7896842Z #define MOD_TAI ADJ_TAI 2025-05-07T19:46:28.7896988Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:46:28.7897076Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:46:28.7897154Z #define __SM_70_RT_H__ 2025-05-07T19:46:28.7897247Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:46:28.7897343Z #define cudaEventWaitDefault 0x00 2025-05-07T19:46:28.7897430Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:46:28.7897582Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:46:28.7897683Z #define _POSIX_MAX_CANON 255 2025-05-07T19:46:28.7897788Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:46:28.7897877Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:46:28.7897968Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:46:28.7898043Z #define __amd64__ 1 2025-05-07T19:46:28.7898126Z #define __WINT_WIDTH__ 32 2025-05-07T19:46:28.7898222Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:46:28.7898484Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:28.7898571Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:46:28.7898649Z #define EOF (-1) 2025-05-07T19:46:28.7898749Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:46:28.7898837Z #define __USE_POSIX199309 1 2025-05-07T19:46:28.7898924Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:46:28.7899011Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:46:28.7899097Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:46:28.7899240Z #define LLONG_MIN (-LLONG_MAX-1) 2025-05-07T19:46:28.7899340Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:46:28.7899438Z #define ____mbstate_t_defined 1 2025-05-07T19:46:28.7899514Z #define STA_NANO 0x2000 2025-05-07T19:46:28.7899603Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:46:28.7899689Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:46:28.7899778Z #define _IO_LINKED 0x80 2025-05-07T19:46:28.7899864Z #define __cpp_lib_launder 201606 2025-05-07T19:46:28.7899951Z #define __SIZEOF_INT128__ 16 2025-05-07T19:46:28.7900050Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:46:28.7900133Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:46:28.7900221Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:46:28.7900349Z #define cudaGraphKernelNodePortProgrammatic 1 2025-05-07T19:46:28.7900460Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:28.7900558Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:28.7900649Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:46:28.7900750Z #define __W_CONTINUED 0xffff 2025-05-07T19:46:28.7900833Z #define __ATOMIC_RELAXED 0 2025-05-07T19:46:28.7900954Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:46:28.7901067Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:28.7901258Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessor 2025-05-07T19:46:28.7901431Z #define __DBL_EPSILON__ double(2.22044604925031308084726333618164062e-16L) 2025-05-07T19:46:28.7901509Z #define __stub_stty 2025-05-07T19:46:28.7901676Z #define _tolower(c) ((int) (*__ctype_tolower_loc ())[(int) (c)]) 2025-05-07T19:46:28.7901754Z #define le16toh(x) (x) 2025-05-07T19:46:28.7901857Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:46:28.7902034Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:46:28.7902106Z #define _SIZET_ 2025-05-07T19:46:28.7902190Z #define XATTR_NAME_MAX 255 2025-05-07T19:46:28.7902270Z #define _SVID_SOURCE 1 2025-05-07T19:46:28.7902356Z #define _LP64 1 2025-05-07T19:46:28.7902441Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:46:28.7902664Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:46:28.7902781Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:46:28.7902860Z #define __UINT8_C(c) c 2025-05-07T19:46:28.7902944Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:46:28.7903030Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:46:28.7903142Z #define __cudaCDP2Memset3DAsync_ptsz 2025-05-07T19:46:28.7903228Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:46:28.7903311Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:46:28.7903414Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:46:28.7903563Z #define CUDARTAPI 2025-05-07T19:46:28.7903641Z #define IOV_MAX 1024 2025-05-07T19:46:28.7903774Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:46:28.7903877Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:46:28.7903967Z #define cudaMemAttachSingle 0x04 2025-05-07T19:46:28.7904044Z #define __wchar_t__ 2025-05-07T19:46:28.7904155Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:46:28.7904234Z #define SEEK_END 2 2025-05-07T19:46:28.7904321Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:46:28.7904483Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:46:28.7904587Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:46:28.7904721Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:46:28.7904807Z #define ____FILE_defined 1 2025-05-07T19:46:28.7904951Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:46:28.7905047Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:46:28.7905128Z #define _ISOC99_SOURCE 1 2025-05-07T19:46:28.7905218Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:46:28.7905468Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:28.7905595Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:46:28.7905676Z #define _IO_RIGHT 04 2025-05-07T19:46:28.7905779Z #define __END_NAMESPACE_STD 2025-05-07T19:46:28.7906009Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:46:28.7906092Z #define _GLIBCXX_STD_C std 2025-05-07T19:46:28.7906208Z #define cudaInitDeviceFlagsAreValid 0x01 2025-05-07T19:46:28.7906302Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:46:28.7906404Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:46:28.7906481Z #define _STDDEF_H_ 2025-05-07T19:46:28.7906654Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:46:28.7906747Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:46:28.7906856Z #define isalnum_l(c,l) __isalnum_l ((c), (l)) 2025-05-07T19:46:28.7907057Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:46:28.7907161Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:28.7907293Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:46:28.7907404Z #define cudaGraphKernelNodePortDefault 0 2025-05-07T19:46:28.7907504Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:46:28.7907606Z #define __cudaCDP2Memcpy3DAsync_ptsz 2025-05-07T19:46:28.7907691Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:46:28.7907809Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:46:28.7907904Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:46:28.7908004Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:46:28.7908101Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:46:28.7908262Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:46:28.7908347Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:46:28.7908516Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:46:28.7908635Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:46:28.7908730Z #define __STDCPP_THREADS__ 1 2025-05-07T19:46:28.7908871Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:46:28.7908979Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:46:28.7909072Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:46:28.7909170Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:46:28.7909275Z #define P_tmpdir "/tmp" 2025-05-07T19:46:28.7909416Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:46:28.7909510Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:46:28.7909617Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:46:28.7909791Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:46:28.7909954Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:46:28.7910049Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:46:28.7910175Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:46:28.7910308Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:46:28.7910456Z #define __location__(a) __annotate__(a) 2025-05-07T19:46:28.7910684Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:46:28.7910800Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:46:28.7910906Z #define __cudaCDP2DeviceGetAttribute 2025-05-07T19:46:28.7911001Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:46:28.7911123Z #define __STDC_UTF_32__ 1 2025-05-07T19:46:28.7911219Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:46:28.7911318Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:46:28.7911421Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:46:28.7911534Z #define __FXSR__ 1 2025-05-07T19:46:28.7911613Z #define _SIZE_T 2025-05-07T19:46:28.7911716Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:46:28.7911847Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:46:28.7912012Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:46:28.7912155Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:46:28.7912257Z #define _IO_ssize_t __ssize_t 2025-05-07T19:46:28.7912381Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:46:28.7912560Z #define __DBL_NORM_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:46:28.7912746Z #define cudaStreamGraphTailLaunch (cudaStream_t)0x0100000000000000 2025-05-07T19:46:28.7912851Z #define _GXX_NULLPTR_T 2025-05-07T19:46:28.7913020Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:46:28.7913103Z #define FOPEN_MAX 16 2025-05-07T19:46:28.7913186Z #define __BIG_ENDIAN 4321 2025-05-07T19:46:28.7913322Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:28.7913408Z #define __suseconds_t_defined 2025-05-07T19:46:28.7913496Z #define __off_t_defined 2025-05-07T19:46:28.7913591Z #define stderr stderr 2025-05-07T19:46:28.7913690Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:46:28.7913799Z #define __glibcxx_requires_string(_String) 2025-05-07T19:46:28.7913893Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:46:28.7913999Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:46:28.7914411Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:46:28.7914495Z #define __mode_t_defined 2025-05-07T19:46:28.7914594Z #define _GCC_SIZE_T 2025-05-07T19:46:28.7914689Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:28.7914788Z #define __cpp_runtime_arrays 198712L 2025-05-07T19:46:28.7914921Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:46:28.7915022Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:46:28.7915118Z #define __UINT32_C(c) c ## U 2025-05-07T19:46:28.7915223Z #define __cpp_alias_templates 200704L 2025-05-07T19:46:28.7915351Z #define cudaHostAllocMapped 0x02 2025-05-07T19:46:28.7915463Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:46:28.7915549Z #define _STL_ITERATOR_H 1 2025-05-07T19:46:28.7915651Z #define __size_t__ 2025-05-07T19:46:28.7915781Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:46:28.7915947Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:46:28.7916056Z #define cudaEventRecordExternal 0x01 2025-05-07T19:46:28.7916226Z #define __isspace_l(c,l) __isctype_l((c), _ISspace, (l)) 2025-05-07T19:46:28.7916484Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:46:28.7916658Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:46:28.7916760Z #define _ENDIAN_H 1 2025-05-07T19:46:28.7916876Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:46:28.7916975Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:46:28.7917081Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:46:28.7917258Z #define __try try 2025-05-07T19:46:28.7917363Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:46:28.7917457Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:46:28.7917569Z #define __INT8_MAX__ 0x7f 2025-05-07T19:46:28.7917843Z #define cudaStreamGetCaptureInfo __CUDART_API_PTSZ(cudaStreamGetCaptureInfo_v2) 2025-05-07T19:46:28.7917948Z #define __LONG_WIDTH__ 64 2025-05-07T19:46:28.7918036Z #define __PIC__ 2 2025-05-07T19:46:28.7918237Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:46:28.7918367Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:46:28.7918510Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:46:28.7918635Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:46:28.7918735Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:46:28.7918938Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:46:28.7919060Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:46:28.7919172Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:46:28.7919277Z #define _IO_uid_t __uid_t 2025-05-07T19:46:28.7919392Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:46:28.7919541Z #define __cudaCDP2EventRecordWithFlags_ptsz 2025-05-07T19:46:28.7919646Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:46:28.7919797Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:46:28.7919922Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:46:28.7920054Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:46:28.7920146Z #define LONG_BIT 64 2025-05-07T19:46:28.7920258Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:46:28.7920383Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:46:28.7920525Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:46:28.7920629Z #define __fsfilcnt_t_defined 2025-05-07T19:46:28.7920803Z #define __blkcnt_t_defined 2025-05-07T19:46:28.7921083Z #define cudaKernelNodeAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:28.7921174Z #define __USE_LARGEFILE 1 2025-05-07T19:46:28.7921274Z #define __cpp_constexpr 201603L 2025-05-07T19:46:28.7921392Z #define CUDART_VERSION 12060 2025-05-07T19:46:28.7921485Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:46:28.7921585Z #define cudaDeviceMapHost 0x08 2025-05-07T19:46:28.7921696Z #define _GLIBCXX_CMATH 1 2025-05-07T19:46:28.7921899Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:46:28.7921992Z #define __lldiv_t_defined 1 2025-05-07T19:46:28.7922086Z #define __SSE2__ 1 2025-05-07T19:46:28.7922188Z #define _IOLBF 1 2025-05-07T19:46:28.7922303Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:46:28.7922402Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:46:28.7922528Z #define __cpp_deduction_guides 201703L 2025-05-07T19:46:28.7922624Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:46:28.7922736Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:46:28.7922831Z #define __INT32_TYPE__ int 2025-05-07T19:46:28.7922938Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:46:28.7923047Z #define cudaDeviceSyncMemops 0x80 2025-05-07T19:46:28.7923144Z #define __cpp_exceptions 199711L 2025-05-07T19:46:28.7923268Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:46:28.7923382Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:46:28.7923481Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:46:28.7923604Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:46:28.7923794Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:46:28.7923892Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:46:28.7923989Z #define __SWORD_TYPE long int 2025-05-07T19:46:28.7924112Z #define __INTMAX_TYPE__ long int 2025-05-07T19:46:28.7924210Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:46:28.7924312Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:46:28.7924454Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:46:28.7924762Z #define cudaStreamAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:28.7924876Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:46:28.7925046Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:46:28.7925173Z #define _T_SIZE 2025-05-07T19:46:28.7925292Z #define cudaHostAllocDefault 0x00 2025-05-07T19:46:28.7925437Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:46:28.7925604Z #define __va_arg_pack() __builtin_va_arg_pack () 2025-05-07T19:46:28.7925713Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:46:28.7925825Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:46:28.7926665Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:46:28.7926812Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:46:28.7926933Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:46:28.7927042Z #define __ATOMIC_CONSUME 1 2025-05-07T19:46:28.7927274Z #define __CUDA_ARCH_HAS_FEATURE__(_FEAT) __CUDA_ARCH_FEAT_ ##_FEAT 2025-05-07T19:46:28.7927385Z #define __GNUC_MINOR__ 4 2025-05-07T19:46:28.7927511Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:46:28.7927625Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:46:28.7927795Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:28.7927897Z #define __PIE__ 2 2025-05-07T19:46:28.7928019Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:46:28.7928165Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:46:28.7928383Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:46:28.7928728Z #define __intN_t(N,MODE) typedef int int ##N ##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:28.7928858Z #define __nlink_t_defined 2025-05-07T19:46:28.7929001Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:46:28.7929120Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:46:28.7929217Z #define _XOPEN_LIM_H 1 2025-05-07T19:46:28.7929513Z #define __u_intN_t(N,MODE) typedef unsigned int u_int ##N ##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:28.7929643Z #define __cpp_template_template_args 201611L 2025-05-07T19:46:28.7929833Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:46:28.7929968Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:46:28.7930069Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:46:28.7930168Z #define __FILE_defined 1 2025-05-07T19:46:28.7930356Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:46:28.7930494Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:46:28.7930601Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:46:28.7930720Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:46:28.7930881Z #define isascii_l(c,l) __isascii_l ((c), (l)) 2025-05-07T19:46:28.7930999Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:46:28.7931112Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:46:28.7931210Z #define __INT16_C(c) c 2025-05-07T19:46:28.7931346Z #define __U32_TYPE unsigned int 2025-05-07T19:46:28.7931454Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:46:28.7931752Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:46:28.7931882Z #define __STDC__ 1 2025-05-07T19:46:28.7931992Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:46:28.7932104Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:46:28.7932240Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:46:28.7932404Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:46:28.7932505Z #define __FLT32X_DIG__ 15 2025-05-07T19:46:28.7932619Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:46:28.7932756Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:46:28.7932880Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:46:28.7933004Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:46:28.7933152Z #define USHRT_MAX (SHRT_MAX * 2 + 1) 2025-05-07T19:46:28.7933267Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:46:28.7933367Z #define stdin stdin 2025-05-07T19:46:28.7933471Z #define __ino64_t_defined 2025-05-07T19:46:28.7933605Z #define STA_CLK 0x8000 2025-05-07T19:46:28.7933713Z #define __clockid_t_defined 1 2025-05-07T19:46:28.7933875Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:46:28.7934083Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:46:28.7934194Z #define __cudaCDP2MemsetAsync 2025-05-07T19:46:28.7934313Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:46:28.7934434Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:46:28.7934573Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:46:28.7934786Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:46:28.7934890Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:46:28.7935526Z #define __tobody(c,f,a,args) (__extension__ ({ int __res; if (sizeof (c) > 1) { if (__builtin_constant_p (c)) { int __c = (c); __res = __c < -128 || __c > 255 ? __c : (a)[__c]; } else __res = f args; } else __res = (a)[(int) (c)]; __res; })) 2025-05-07T19:46:28.7935622Z #define DOMAIN 1 2025-05-07T19:46:28.7935723Z #define M_LN2 0.69314718055994530942 2025-05-07T19:46:28.7935850Z #define __NVCC__ 1 2025-05-07T19:46:28.7935967Z #define __cudaCDP2Memset2DAsync 2025-05-07T19:46:28.7936091Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:28.7936211Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:46:28.7936354Z #define __throw_exception_again throw 2025-05-07T19:46:28.7936458Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:46:28.7936560Z #define __EXCEPTION_H 1 2025-05-07T19:46:28.7936696Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:46:28.7936816Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:46:28.7937130Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:28.7937256Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:46:28.7937393Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:46:28.7937499Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:46:28.7937616Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:46:28.7937753Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:46:28.7937912Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:46:28.7938086Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:28.7938236Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:46:28.7938343Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:46:28.7938464Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:46:28.7938573Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:46:28.7938714Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:46:28.7938863Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:46:28.7938972Z #define __useconds_t_defined 2025-05-07T19:46:28.7939125Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:46:28.7939332Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:46:28.7939497Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:46:28.7939601Z #define __SSE_MATH__ 1 2025-05-07T19:46:28.7939735Z #define _IO_wint_t wint_t 2025-05-07T19:46:28.7939847Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:46:28.7939958Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:46:28.7940099Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:46:28.7940227Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:46:28.7940340Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:46:28.7940447Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:46:28.7940573Z #define __USE_ATFILE 1 2025-05-07T19:46:28.7940681Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:46:28.7940790Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:46:28.7940918Z #define _GCC_PTRDIFF_T 2025-05-07T19:46:28.7941154Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:28.7941267Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:46:28.7941384Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:46:28.7941531Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:28.7941655Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:46:28.7941750Z #define _STDLIB_H 1 2025-05-07T19:46:28.7941941Z #define __exctype(name) extern int name (int) __THROW 2025-05-07T19:46:28.7942045Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:46:28.7942153Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:46:28.7942292Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:28.7942431Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:28.7942532Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:46:28.7942722Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:46:28.7942908Z #define __isxdigit_l(c,l) __isctype_l((c), _ISxdigit, (l)) 2025-05-07T19:46:28.7943025Z #define __glibcxx_requires_nonempty() 2025-05-07T19:46:28.7943153Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:46:28.7943284Z #define __ldiv_t_defined 1 2025-05-07T19:46:28.7943628Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:46:28.7943722Z #define ___int_ptrdiff_t_h 2025-05-07T19:46:28.7943895Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:46:28.7944022Z #define __cudaCDP2EventDestroy 2025-05-07T19:46:28.7944118Z #define __HOST_DEFINES_H__ 2025-05-07T19:46:28.7944232Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:28.7944364Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:28.7944467Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:46:28.7944562Z #define CUDART_CB 2025-05-07T19:46:28.7944661Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:46:28.7944811Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:46:28.7944897Z #define MB_LEN_MAX 16 2025-05-07T19:46:28.7945120Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:28.7945255Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:46:28.7945384Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:46:28.7945497Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:46:28.7945602Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:46:28.7945792Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:46:28.7945900Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:46:28.7945992Z #define _GNU_SOURCE 1 2025-05-07T19:46:28.7946166Z #define __stub_putmsg 2025-05-07T19:46:28.7946250Z #define __CUDACC__ 1 2025-05-07T19:46:28.7946345Z #define __N(msgid) (msgid) 2025-05-07T19:46:28.7946589Z #define __P(args) args 2025-05-07T19:46:28.7947006Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:46:28.7947109Z #define __cpp_init_captures 201304L 2025-05-07T19:46:28.7947226Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:46:28.7947352Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:46:28.7947465Z #define __cpp_lib_as_const 201510 2025-05-07T19:46:28.7947559Z #define __WCHAR_T 2025-05-07T19:46:28.7947726Z #define __ATOMIC_RELEASE 3 2025-05-07T19:46:28.7947836Z #define __fsblkcnt_t_defined 2025-05-07T19:46:28.7947960Z #define __cudaCDP2EventCreateWithFlags 2025-05-07T19:46:28.7948072Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:46:28.7948080Z 2025-05-07T19:46:28.8223077Z 2025-05-07T19:46:28.8223829Z + conda run -n build_binary nvcc --version 2025-05-07T19:46:28.8223880Z 2025-05-07T19:46:30.6599488Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:46:30.6599957Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:46:30.6600302Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:46:30.6600669Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:46:30.6601025Z Build cuda_12.6.r12.6/compiler.35059454_0 2025-05-07T19:46:30.6601278Z 2025-05-07T19:46:30.7348161Z 2025-05-07T19:46:30.7360560Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:46:30.7361968Z [CHECK] nvidia-smi not found 2025-05-07T19:46:30.7362341Z [INSTALL] Successfully installed CUDA 12.6.3 2025-05-07T19:46:30.7460858Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:30.7461495Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:30.7462103Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:46:30.7462479Z env: 2025-05-07T19:46:30.7462722Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:46:30.7463024Z BUILD_ENV: build_binary 2025-05-07T19:46:30.7463284Z BUILD_TARGET: default 2025-05-07T19:46:30.7463519Z BUILD_VARIANT: cuda 2025-05-07T19:46:30.7463766Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:46:30.7464015Z ##[endgroup] 2025-05-07T19:46:31.0892798Z ################################################################################ 2025-05-07T19:46:31.0893405Z # Install PyTorch (PIP) 2025-05-07T19:46:31.0893662Z # 2025-05-07T19:46:31.0911940Z # [2025-05-07T19:46:31.090Z] + install_pytorch_pip build_binary nightly cuda/12.6.3 2025-05-07T19:46:31.0912716Z ################################################################################ 2025-05-07T19:46:31.0912992Z 2025-05-07T19:46:31.0944388Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:46:32.0034687Z Channels: 2025-05-07T19:46:32.0035058Z - conda-forge 2025-05-07T19:46:32.0035394Z Platform: linux-64 2025-05-07T19:46:34.9559787Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:46:36.5543424Z Solving environment: \ | / - done 2025-05-07T19:46:36.8421264Z 2025-05-07T19:46:36.8421640Z ## Package Plan ## 2025-05-07T19:46:36.8422026Z 2025-05-07T19:46:36.8422446Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:36.8422791Z 2025-05-07T19:46:36.8422911Z added / updated specs: 2025-05-07T19:46:36.8423218Z - numpy 2025-05-07T19:46:36.8423351Z 2025-05-07T19:46:36.8423357Z 2025-05-07T19:46:36.8423522Z The following packages will be downloaded: 2025-05-07T19:46:36.8423762Z 2025-05-07T19:46:36.8423934Z package | build 2025-05-07T19:46:36.8424315Z ---------------------------|----------------- 2025-05-07T19:46:36.8424737Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:46:36.8425257Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:46:36.8426009Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:46:36.8426517Z numpy-2.2.5 | py310hefbff90_0 7.6 MB conda-forge 2025-05-07T19:46:36.8426973Z ------------------------------------------------------------ 2025-05-07T19:46:36.8427345Z Total: 7.6 MB 2025-05-07T19:46:36.8427572Z 2025-05-07T19:46:36.8427742Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:36.8428060Z 2025-05-07T19:46:36.8428503Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:46:36.8429334Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:46:36.8429923Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:46:36.8430438Z numpy conda-forge/linux-64::numpy-2.2.5-py310hefbff90_0 2025-05-07T19:46:36.8430739Z 2025-05-07T19:46:36.8430770Z 2025-05-07T19:46:36.8430774Z 2025-05-07T19:46:36.8430934Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:36.8431332Z numpy-2.2.5 | 7.6 MB | | 0% 2025-05-07T19:46:36.8431606Z 2025-05-07T19:46:36.8431989Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:36.8432245Z 2025-05-07T19:46:36.8432249Z 2025-05-07T19:46:36.8438009Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:36.8438445Z 2025-05-07T19:46:36.8438449Z 2025-05-07T19:46:36.8438453Z 2025-05-07T19:46:37.2204198Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:37.2205396Z 2025-05-07T19:46:37.2206026Z 2025-05-07T19:46:37.2207345Z libcblas-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:37.2207838Z 2025-05-07T19:46:37.2207844Z 2025-05-07T19:46:37.2367933Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:37.2368352Z 2025-05-07T19:46:37.2371091Z libblas-3.9.0 | 16 KB | #########7 | 97%  2025-05-07T19:46:37.2371563Z 2025-05-07T19:46:37.2419979Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:37.2420418Z 2025-05-07T19:46:37.2420423Z 2025-05-07T19:46:37.2471248Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:37.2471578Z 2025-05-07T19:46:37.2471755Z 2025-05-07T19:46:37.2471765Z 2025-05-07T19:46:37.2472493Z liblapack-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:37.2472893Z 2025-05-07T19:46:37.2472901Z 2025-05-07T19:46:37.2474511Z 2025-05-07T19:46:37.2586635Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:37.2587940Z 2025-05-07T19:46:37.2716309Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:37.2782387Z numpy-2.2.5 | 7.6 MB | | 0% 2025-05-07T19:46:37.2782875Z 2025-05-07T19:46:37.2782883Z 2025-05-07T19:46:37.2782888Z 2025-05-07T19:46:37.3719188Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:37.4773769Z numpy-2.2.5 | 7.6 MB | #####4 | 54% 2025-05-07T19:46:37.4965119Z numpy-2.2.5 | 7.6 MB | #########8 | 99% 2025-05-07T19:46:37.8690719Z numpy-2.2.5 | 7.6 MB | ########## | 100% 2025-05-07T19:46:37.8694847Z numpy-2.2.5 | 7.6 MB | ########## | 100% 2025-05-07T19:46:37.8695485Z 2025-05-07T19:46:37.8695710Z 2025-05-07T19:46:37.8696164Z  2025-05-07T19:46:37.8696395Z 2025-05-07T19:46:37.8696402Z 2025-05-07T19:46:37.8696640Z  2025-05-07T19:46:37.8696899Z 2025-05-07T19:46:37.8696903Z 2025-05-07T19:46:37.8696906Z 2025-05-07T19:46:37.8698887Z  done 2025-05-07T19:46:37.9710651Z Preparing transaction: | done 2025-05-07T19:46:38.1725271Z Verifying transaction: - \ done 2025-05-07T19:46:38.2739684Z Executing transaction: / done 2025-05-07T19:46:38.3811670Z ################################################################################ 2025-05-07T19:46:38.3812226Z # Install Package From PyTorch PIP: torch 2025-05-07T19:46:38.3812566Z # 2025-05-07T19:46:38.3830881Z # [2025-05-07T19:46:38.382Z] + install_from_pytorch_pip build_binary torch nightly cuda/12.6.3 2025-05-07T19:46:38.3831457Z ################################################################################ 2025-05-07T19:46:38.3831707Z 2025-05-07T19:46:38.3848725Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:46:38.4671836Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:46:38.4672355Z ################################################################################ 2025-05-07T19:46:38.4672782Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:46:38.4673135Z # 2025-05-07T19:46:38.4685112Z # [2025-05-07T19:46:38.468Z] + __prepare_pip_arguments torch nightly cuda/12.6.3 2025-05-07T19:46:38.4685842Z ################################################################################ 2025-05-07T19:46:38.4686084Z 2025-05-07T19:46:38.4709495Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:46:38.4733814Z [INSTALL] Extracted package variant: cu126 2025-05-07T19:46:38.4745958Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:46:38.4748961Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:46:38.4755455Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:46:38.4762461Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu126/ ... 2025-05-07T19:46:38.4798343Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:12.0470907Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:48:12.0472847Z 2025-05-07T19:48:12.0473373Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:12.0473913Z Collecting torch 2025-05-07T19:48:12.0474653Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp310-cp310-manylinux_2_28_x86_64.whl.metadata (30 kB) 2025-05-07T19:48:12.0475460Z Collecting filelock (from torch) 2025-05-07T19:48:12.0476326Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:48:12.0477339Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from torch) (4.13.2) 2025-05-07T19:48:12.0478174Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:48:12.0478742Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:48:12.0479719Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 39.4 MB/s eta 0:00:00 2025-05-07T19:48:12.0480134Z Collecting networkx (from torch) 2025-05-07T19:48:12.0480683Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:48:12.0481414Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 13.6 MB/s eta 0:00:00 2025-05-07T19:48:12.0482180Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from torch) (3.1.6) 2025-05-07T19:48:12.0483020Z Collecting fsspec (from torch) 2025-05-07T19:48:12.0483561Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:48:12.0484162Z Collecting nvidia-cuda-nvrtc-cu12==12.6.77 (from torch) 2025-05-07T19:48:12.0485327Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_nvrtc_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (23.7 MB) 2025-05-07T19:48:12.0486171Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.7/23.7 MB 54.9 MB/s eta 0:00:00 2025-05-07T19:48:12.0486745Z Collecting nvidia-cuda-runtime-cu12==12.6.77 (from torch) 2025-05-07T19:48:12.0487474Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_runtime_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (897 kB) 2025-05-07T19:48:12.0488297Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 897.7/897.7 kB 5.6 MB/s eta 0:00:00 2025-05-07T19:48:12.0488737Z Collecting nvidia-cuda-cupti-cu12==12.6.80 (from torch) 2025-05-07T19:48:12.0489445Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_cupti_cu12-12.6.80-py3-none-manylinux2014_x86_64.whl (8.9 MB) 2025-05-07T19:48:12.0490262Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.9/8.9 MB 38.1 MB/s eta 0:00:00 2025-05-07T19:48:12.0490655Z Collecting nvidia-cudnn-cu12==9.5.1.17 (from torch) 2025-05-07T19:48:12.0491367Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cudnn_cu12-9.5.1.17-py3-none-manylinux_2_28_x86_64.whl (571.0 MB) 2025-05-07T19:48:12.0492184Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 571.0/571.0 MB 41.0 MB/s eta 0:00:00 2025-05-07T19:48:12.0492573Z Collecting nvidia-cublas-cu12==12.6.4.1 (from torch) 2025-05-07T19:48:12.0493379Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cublas_cu12-12.6.4.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (393.1 MB) 2025-05-07T19:48:12.0494246Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 393.1/393.1 MB 49.0 MB/s eta 0:00:00 2025-05-07T19:48:12.0494810Z Collecting nvidia-cufft-cu12==11.3.0.4 (from torch) 2025-05-07T19:48:12.0495529Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufft_cu12-11.3.0.4-py3-none-manylinux2014_x86_64.whl (200.2 MB) 2025-05-07T19:48:12.0496310Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 200.2/200.2 MB 72.0 MB/s eta 0:00:00 2025-05-07T19:48:12.0496745Z Collecting nvidia-curand-cu12==10.3.7.77 (from torch) 2025-05-07T19:48:12.0497438Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_curand_cu12-10.3.7.77-py3-none-manylinux2014_x86_64.whl (56.3 MB) 2025-05-07T19:48:12.0498251Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.3/56.3 MB 64.5 MB/s eta 0:00:00 2025-05-07T19:48:12.0498701Z Collecting nvidia-cusolver-cu12==11.7.1.2 (from torch) 2025-05-07T19:48:12.0499413Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusolver_cu12-11.7.1.2-py3-none-manylinux2014_x86_64.whl (158.2 MB) 2025-05-07T19:48:12.0500247Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 158.2/158.2 MB 75.9 MB/s eta 0:00:00 2025-05-07T19:48:12.0500650Z Collecting nvidia-cusparse-cu12==12.5.4.2 (from torch) 2025-05-07T19:48:12.0501396Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparse_cu12-12.5.4.2-py3-none-manylinux2014_x86_64.whl (216.6 MB) 2025-05-07T19:48:12.0502243Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 216.6/216.6 MB 76.0 MB/s eta 0:00:00 2025-05-07T19:48:12.0502639Z Collecting nvidia-cusparselt-cu12==0.6.3 (from torch) 2025-05-07T19:48:12.0503379Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparselt_cu12-0.6.3-py3-none-manylinux2014_x86_64.whl (156.8 MB) 2025-05-07T19:48:12.0504178Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 156.8/156.8 MB 61.4 MB/s eta 0:00:00 2025-05-07T19:48:12.0504584Z Collecting nvidia-nccl-cu12==2.26.2 (from torch) 2025-05-07T19:48:12.0505390Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (2.0 kB) 2025-05-07T19:48:12.0506187Z Collecting nvidia-nvtx-cu12==12.6.77 (from torch) 2025-05-07T19:48:12.0506887Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvtx_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (89 kB) 2025-05-07T19:48:12.0507572Z Collecting nvidia-nvjitlink-cu12==12.6.85 (from torch) 2025-05-07T19:48:12.0508479Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvjitlink_cu12-12.6.85-py3-none-manylinux2010_x86_64.manylinux_2_12_x86_64.whl (19.7 MB) 2025-05-07T19:48:12.0509388Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 19.7/19.7 MB 59.2 MB/s eta 0:00:00 2025-05-07T19:48:12.0509781Z Collecting nvidia-cufile-cu12==1.11.1.6 (from torch) 2025-05-07T19:48:12.0510609Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.5 kB) 2025-05-07T19:48:12.0511435Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:48:12.0512321Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:48:12.0513646Z Requirement already satisfied: setuptools>=40.8.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from pytorch-triton==3.3.0+git96316ce5->torch) (78.1.1) 2025-05-07T19:48:12.0514534Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:48:12.0515122Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:48:12.0515874Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 2.7 MB/s eta 0:00:00 2025-05-07T19:48:12.0516881Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:48:12.0518087Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp310-cp310-manylinux_2_28_x86_64.whl (825.5 MB) 2025-05-07T19:48:12.0519066Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 825.5/825.5 MB 29.6 MB/s eta 0:00:00 2025-05-07T19:48:12.0519939Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (1.1 MB) 2025-05-07T19:48:12.0520877Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 MB 6.7 MB/s eta 0:00:00 2025-05-07T19:48:12.0521720Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (201.3 MB) 2025-05-07T19:48:12.0522769Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.3/201.3 MB 63.9 MB/s eta 0:00:00 2025-05-07T19:48:12.0523570Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.4 MB) 2025-05-07T19:48:12.0524474Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.4/153.4 MB 68.1 MB/s eta 0:00:00 2025-05-07T19:48:12.0542038Z Installing collected packages: nvidia-cusparselt-cu12, mpmath, sympy, pytorch-triton, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufile-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, networkx, fsspec, filelock, nvidia-cusparse-cu12, nvidia-cufft-cu12, nvidia-cudnn-cu12, nvidia-cusolver-cu12, torch 2025-05-07T19:48:12.0543802Z 2025-05-07T19:48:12.0545692Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu12-12.6.4.1 nvidia-cuda-cupti-cu12-12.6.80 nvidia-cuda-nvrtc-cu12-12.6.77 nvidia-cuda-runtime-cu12-12.6.77 nvidia-cudnn-cu12-9.5.1.17 nvidia-cufft-cu12-11.3.0.4 nvidia-cufile-cu12-1.11.1.6 nvidia-curand-cu12-10.3.7.77 nvidia-cusolver-cu12-11.7.1.2 nvidia-cusparse-cu12-12.5.4.2 nvidia-cusparselt-cu12-0.6.3 nvidia-nccl-cu12-2.26.2 nvidia-nvjitlink-cu12-12.6.85 nvidia-nvtx-cu12-12.6.77 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu126 2025-05-07T19:48:12.0548212Z 2025-05-07T19:48:14.1740521Z torch 2.8.0.dev20250507+cu126 2025-05-07T19:48:14.1741156Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu126) 2025-05-07T19:48:17.5623598Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:48:20.9200797Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu126 2025-05-07T19:48:20.9202183Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:48:24.1836640Z True 2025-05-07T19:48:24.1837286Z True 2025-05-07T19:48:24.1837609Z 2025-05-07T19:48:24.2592658Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:48:24.2668257Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:24.2668947Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:24.2669633Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:24.2670008Z env: 2025-05-07T19:48:24.2670261Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:24.2670620Z BUILD_ENV: build_binary 2025-05-07T19:48:24.2670896Z BUILD_TARGET: default 2025-05-07T19:48:24.2671181Z BUILD_VARIANT: cuda 2025-05-07T19:48:24.2671470Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:24.2671770Z ##[endgroup] 2025-05-07T19:48:24.7041138Z /github/home/miniconda/bin/conda 2025-05-07T19:48:24.7042035Z ################################################################################ 2025-05-07T19:48:24.7042607Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:48:24.7043004Z # 2025-05-07T19:48:24.7057327Z # [2025-05-07T19:48:24.705Z] + collect_pytorch_env_info build_binary 2025-05-07T19:48:24.7058569Z ################################################################################ 2025-05-07T19:48:24.7080059Z 2025-05-07T19:48:24.7080387Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:24.7975042Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:24.7987271Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:48:24.7988029Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:48:24.7988483Z 2025-05-07T19:48:24.8872872Z 2025-05-07T19:48:24.8873830Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:48:24.8894869Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:48:30.4608758Z Collecting environment information... 2025-05-07T19:48:30.4610586Z PyTorch version: 2.8.0.dev20250507+cu126 2025-05-07T19:48:30.4611583Z Is debug build: False 2025-05-07T19:48:30.4612311Z CUDA used to build PyTorch: 12.6 2025-05-07T19:48:30.4613172Z ROCM used to build PyTorch: N/A 2025-05-07T19:48:30.4613701Z 2025-05-07T19:48:30.4614047Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:48:30.4614779Z GCC version: (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:48:30.4615142Z Clang version: Could not collect 2025-05-07T19:48:30.4615433Z CMake version: version 4.0.2 2025-05-07T19:48:30.4615750Z Libc version: glibc-2.34 2025-05-07T19:48:30.4615909Z 2025-05-07T19:48:30.4616230Z Python version: 3.10.17 | packaged by conda-forge | (main, Apr 10 2025, 22:19:12) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:48:30.4617275Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:48:30.4617730Z Is CUDA available: False 2025-05-07T19:48:30.4618053Z CUDA runtime version: 12.6.85 2025-05-07T19:48:30.4618363Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:48:30.4618685Z GPU models and configuration: Could not collect 2025-05-07T19:48:30.4619065Z Nvidia driver version: Could not collect 2025-05-07T19:48:30.4619410Z cuDNN version: Could not collect 2025-05-07T19:48:30.4619720Z HIP runtime version: N/A 2025-05-07T19:48:30.4620006Z MIOpen runtime version: N/A 2025-05-07T19:48:30.4620275Z Is XNNPACK available: True 2025-05-07T19:48:30.4620471Z 2025-05-07T19:48:30.4620556Z CPU: 2025-05-07T19:48:30.4620776Z Architecture: x86_64 2025-05-07T19:48:30.4621144Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:48:30.4621540Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:48:30.4621958Z Byte Order: Little Endian 2025-05-07T19:48:30.4622307Z CPU(s): 96 2025-05-07T19:48:30.4622610Z On-line CPU(s) list: 0-95 2025-05-07T19:48:30.4622960Z Vendor ID: GenuineIntel 2025-05-07T19:48:30.4623551Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:48:30.4623970Z CPU family: 6 2025-05-07T19:48:30.4624270Z Model: 85 2025-05-07T19:48:30.4624596Z Thread(s) per core: 2 2025-05-07T19:48:30.4624899Z Core(s) per socket: 24 2025-05-07T19:48:30.4625224Z Socket(s): 2 2025-05-07T19:48:30.4625540Z Stepping: 7 2025-05-07T19:48:30.4625849Z BogoMIPS: 5999.98 2025-05-07T19:48:30.4628088Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:48:30.4630331Z Hypervisor vendor: KVM 2025-05-07T19:48:30.4630650Z Virtualization type: full 2025-05-07T19:48:30.4631028Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:48:30.4631437Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:48:30.4631819Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:48:30.4632223Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:48:30.4632563Z NUMA node(s): 2 2025-05-07T19:48:30.4632899Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:48:30.4633238Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:48:30.4633717Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:48:30.4634293Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:48:30.4634778Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:48:30.4635388Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:30.4636267Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:48:30.4636936Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:30.4637571Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:48:30.4637999Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:48:30.4638517Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:48:30.4638919Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:48:30.4639535Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:48:30.4640397Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:48:30.4641091Z Vulnerability Srbds: Not affected 2025-05-07T19:48:30.4641488Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:48:30.4641773Z 2025-05-07T19:48:30.4641895Z Versions of relevant libraries: 2025-05-07T19:48:30.4642216Z [pip3] numpy==2.2.5 2025-05-07T19:48:30.4642594Z [pip3] nvidia-cublas-cu12==12.6.4.1 2025-05-07T19:48:30.4642926Z [pip3] nvidia-cuda-cupti-cu12==12.6.80 2025-05-07T19:48:30.4643244Z [pip3] nvidia-cuda-nvrtc-cu12==12.6.77 2025-05-07T19:48:30.4643586Z [pip3] nvidia-cuda-runtime-cu12==12.6.77 2025-05-07T19:48:30.4643903Z [pip3] nvidia-cudnn-cu12==9.5.1.17 2025-05-07T19:48:30.4644224Z [pip3] nvidia-cufft-cu12==11.3.0.4 2025-05-07T19:48:30.4644514Z [pip3] nvidia-curand-cu12==10.3.7.77 2025-05-07T19:48:30.4644840Z [pip3] nvidia-cusolver-cu12==11.7.1.2 2025-05-07T19:48:30.4645167Z [pip3] nvidia-cusparse-cu12==12.5.4.2 2025-05-07T19:48:30.4645574Z [pip3] nvidia-cusparselt-cu12==0.6.3 2025-05-07T19:48:30.4645905Z [pip3] nvidia-nccl-cu12==2.26.2 2025-05-07T19:48:30.4646190Z [pip3] nvidia-nvjitlink-cu12==12.6.85 2025-05-07T19:48:30.4646722Z [pip3] nvidia-nvtx-cu12==12.6.77 2025-05-07T19:48:30.4647212Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:48:30.4647621Z [pip3] torch==2.8.0.dev20250507+cu126 2025-05-07T19:48:30.4648023Z [conda] cuda-cudart 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:30.4648580Z [conda] cuda-cudart-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:30.4649167Z [conda] cuda-cudart-dev_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:30.4649744Z [conda] cuda-cudart-static 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:30.4650362Z [conda] cuda-cudart-static_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:30.4650956Z [conda] cuda-cudart_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:30.4651504Z [conda] cuda-cupti 12.6.80 hbd13f7d_0 conda-forge 2025-05-07T19:48:30.4652015Z [conda] cuda-cupti-dev 12.6.80 h5888daf_0 conda-forge 2025-05-07T19:48:30.4652573Z [conda] cuda-libraries 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:30.4653137Z [conda] cuda-libraries-dev 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:30.4653661Z [conda] cuda-nvrtc 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:30.4654197Z [conda] cuda-nvrtc-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:30.4654712Z [conda] cuda-nvtx 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:30.4655241Z [conda] cuda-opencl 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:30.4655786Z [conda] cuda-opencl-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:30.4656313Z [conda] cuda-runtime 12.6.3 ha804496_0 conda-forge 2025-05-07T19:48:30.4656840Z [conda] libcublas 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:30.4657347Z [conda] libcublas-dev 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:30.4658018Z [conda] libcufft 11.3.0.4 hbd13f7d_0 conda-forge 2025-05-07T19:48:30.4658938Z [conda] libcufft-dev 11.3.0.4 h5888daf_0 conda-forge 2025-05-07T19:48:30.4659577Z [conda] libcurand 10.3.7.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:30.4660241Z [conda] libcurand-dev 10.3.7.77 h5888daf_0 conda-forge 2025-05-07T19:48:30.4660725Z [conda] libcusolver 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:30.4661244Z [conda] libcusolver-dev 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:30.4661739Z [conda] libcusparse 12.5.4.2 hbd13f7d_0 conda-forge 2025-05-07T19:48:30.4662257Z [conda] libcusparse-dev 12.5.4.2 h5888daf_0 conda-forge 2025-05-07T19:48:30.4662786Z [conda] libnvjitlink 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:30.4663279Z [conda] libnvjitlink-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:30.4663781Z [conda] numpy 2.2.5 py310hefbff90_0 conda-forge 2025-05-07T19:48:30.4664249Z [conda] nvidia-cublas-cu12 12.6.4.1 pypi_0 pypi 2025-05-07T19:48:30.4664781Z [conda] nvidia-cuda-cupti-cu12 12.6.80 pypi_0 pypi 2025-05-07T19:48:30.4665319Z [conda] nvidia-cuda-nvrtc-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:30.4665827Z [conda] nvidia-cuda-runtime-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:30.4666350Z [conda] nvidia-cudnn-cu12 9.5.1.17 pypi_0 pypi 2025-05-07T19:48:30.4666911Z [conda] nvidia-cufft-cu12 11.3.0.4 pypi_0 pypi 2025-05-07T19:48:30.4667423Z [conda] nvidia-curand-cu12 10.3.7.77 pypi_0 pypi 2025-05-07T19:48:30.4667916Z [conda] nvidia-cusolver-cu12 11.7.1.2 pypi_0 pypi 2025-05-07T19:48:30.4668444Z [conda] nvidia-cusparse-cu12 12.5.4.2 pypi_0 pypi 2025-05-07T19:48:30.4668985Z [conda] nvidia-cusparselt-cu12 0.6.3 pypi_0 pypi 2025-05-07T19:48:30.4669479Z [conda] nvidia-nccl-cu12 2.26.2 pypi_0 pypi 2025-05-07T19:48:30.4669999Z [conda] nvidia-nvjitlink-cu12 12.6.85 pypi_0 pypi 2025-05-07T19:48:30.4670489Z [conda] nvidia-nvtx-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:30.4671003Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:48:30.4671478Z [conda] torch 2.8.0.dev20250507+cu126 pypi_0 pypi 2025-05-07T19:48:30.4671780Z 2025-05-07T19:48:30.5463224Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:30.5463868Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:30.5464470Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:30.5464807Z env: 2025-05-07T19:48:30.5465060Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:30.5465362Z BUILD_ENV: build_binary 2025-05-07T19:48:30.5465618Z BUILD_TARGET: default 2025-05-07T19:48:30.5465872Z BUILD_VARIANT: cuda 2025-05-07T19:48:30.5466243Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:30.5466534Z ##[endgroup] 2025-05-07T19:48:30.9836903Z ################################################################################ 2025-05-07T19:48:30.9837534Z # Install cuDNN 2025-05-07T19:48:30.9837773Z # 2025-05-07T19:48:30.9864092Z # [2025-05-07T19:48:30.985Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 12.6.3 2025-05-07T19:48:30.9866204Z ################################################################################ 2025-05-07T19:48:30.9866890Z 2025-05-07T19:48:30.9878810Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:31.0836266Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:31.0837176Z [INSTALL] cuda_concat_version is determined to be: 126 2025-05-07T19:48:31.0837621Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:31.0837858Z 2025-05-07T19:48:31.0850409Z 2025-05-07T19:48:31.0851503Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:31.0852847Z 2025-05-07T19:48:31.0866471Z 2025-05-07T19:48:31.0891801Z [INSTALL] Downloading cuDNN to /tmp/tmp.UT03GynLkR ... 2025-05-07T19:48:31.0919017Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/cudnn/redist/cudnn/linux-x86_64/cudnn-linux-x86_64-9.5.1.17_cuda12-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:48:38.1776635Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:48:38.1777320Z + tar -xvf cudnn.tar.xz 2025-05-07T19:48:38.1777531Z 2025-05-07T19:48:38.1800764Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/ 2025-05-07T19:48:38.1801270Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/ 2025-05-07T19:48:38.1803418Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static_v9.a 2025-05-07T19:48:42.7324812Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static_v9.a 2025-05-07T19:48:42.7944175Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static_v9.a 2025-05-07T19:48:50.2096462Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static_v9.a 2025-05-07T19:48:50.4512358Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static_v9.a 2025-05-07T19:48:50.4886205Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static_v9.a 2025-05-07T19:48:51.0236182Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static_v9.a 2025-05-07T19:48:53.1135916Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static.a 2025-05-07T19:48:53.1136480Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static.a 2025-05-07T19:48:53.1137052Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static.a 2025-05-07T19:48:53.1137686Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static.a 2025-05-07T19:48:53.1138274Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static.a 2025-05-07T19:48:53.1138786Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static.a 2025-05-07T19:48:53.1139319Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static.a 2025-05-07T19:48:53.1139791Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so 2025-05-07T19:48:53.1140344Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9 2025-05-07T19:48:53.1140783Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9.5.1 2025-05-07T19:48:53.1145214Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so 2025-05-07T19:48:53.1147476Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9 2025-05-07T19:48:53.1148893Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9.5.1 2025-05-07T19:48:57.6379841Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so 2025-05-07T19:48:57.6380468Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9.5.1 2025-05-07T19:48:57.6997677Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9 2025-05-07T19:48:57.6998359Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9.5.1 2025-05-07T19:49:04.8919017Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9 2025-05-07T19:49:04.8919711Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so 2025-05-07T19:49:04.8920300Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so 2025-05-07T19:49:04.8920954Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9.5.1 2025-05-07T19:49:05.0819502Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9 2025-05-07T19:49:05.0820215Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9 2025-05-07T19:49:05.0820744Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so 2025-05-07T19:49:05.0821294Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9.5.1 2025-05-07T19:49:05.1177284Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9.5.1 2025-05-07T19:49:05.6440393Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9 2025-05-07T19:49:05.6441401Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so 2025-05-07T19:49:05.6441971Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9 2025-05-07T19:49:05.6442510Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so 2025-05-07T19:49:05.6443076Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9.5.1 2025-05-07T19:49:07.7224112Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/ 2025-05-07T19:49:07.7225449Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_v9.h 2025-05-07T19:49:07.7226380Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv_v9.h 2025-05-07T19:49:07.7226914Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend_v9.h 2025-05-07T19:49:07.7227420Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn_v9.h 2025-05-07T19:49:07.7227927Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph_v9.h 2025-05-07T19:49:07.7228535Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops_v9.h 2025-05-07T19:49:07.7229070Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version_v9.h 2025-05-07T19:49:07.7229532Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn.h 2025-05-07T19:49:07.7229995Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv.h 2025-05-07T19:49:07.7230479Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend.h 2025-05-07T19:49:07.7230956Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn.h 2025-05-07T19:49:07.7231449Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph.h 2025-05-07T19:49:07.7231908Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops.h 2025-05-07T19:49:07.7232389Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version.h 2025-05-07T19:49:07.7232830Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/LICENSE 2025-05-07T19:49:07.7239922Z 2025-05-07T19:49:07.7241037Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:49:07.7241652Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:07.7241978Z 2025-05-07T19:49:07.7257742Z 2025-05-07T19:49:07.7257987Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:07.7258280Z 2025-05-07T19:49:07.7274067Z 2025-05-07T19:49:07.7274569Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:07.7275024Z 2025-05-07T19:49:07.7300329Z 2025-05-07T19:49:07.7302128Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:07.7303313Z 2025-05-07T19:49:08.6675883Z 2025-05-07T19:49:08.6676331Z /__w/FBGEMM/FBGEMM 2025-05-07T19:49:08.6677131Z + rm -rf /tmp/tmp.UT03GynLkR 2025-05-07T19:49:08.7355707Z 2025-05-07T19:49:08.7355721Z 2025-05-07T19:49:08.7359849Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:49:08.7360813Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:08.7361500Z 2025-05-07T19:49:09.1506389Z 2025-05-07T19:49:09.1506835Z [INSTALL] Successfully installed cuDNN (for CUDA 12.6.3) 2025-05-07T19:49:09.1575034Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:09.1575629Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:09.1576223Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:09.1576547Z env: 2025-05-07T19:49:09.1576765Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:09.1577088Z BUILD_ENV: build_binary 2025-05-07T19:49:09.1577326Z BUILD_TARGET: default 2025-05-07T19:49:09.1577565Z BUILD_VARIANT: cuda 2025-05-07T19:49:09.1577796Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:09.1578052Z ##[endgroup] 2025-05-07T19:49:09.6332541Z ################################################################################ 2025-05-07T19:49:09.6332968Z # Prepare FBGEMM-GPU Build 2025-05-07T19:49:09.6333246Z # 2025-05-07T19:49:09.6356630Z # [2025-05-07T19:49:09.634Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:49:09.6357945Z ################################################################################ 2025-05-07T19:49:09.6358608Z 2025-05-07T19:49:09.6377573Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:09.7248690Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:09.7264323Z [BUILD] Running git submodules update ... 2025-05-07T19:49:09.7282948Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:49:09.7582198Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:49:09.7582792Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:49:09.7583279Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:49:09.7583712Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:49:09.7584145Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:49:09.7584738Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:49:09.7585180Z Synchronizing submodule url for '../external/json' 2025-05-07T19:49:09.7613058Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:49:09.8037177Z [BUILD] Installing other build dependencies ... 2025-05-07T19:49:09.8054909Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:49:11.8945058Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:49:11.9116731Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:49:11.9212758Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:49:12.0527234Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:49:12.0557135Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:49:12.0634368Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:49:12.0635928Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:49:12.0637545Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:49:12.0642115Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:49:12.0947173Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:49:12.1007282Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:49:12.1080327Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:49:12.1227287Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:49:12.1257846Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:49:12.1323593Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:49:12.1327557Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:49:12.1331459Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:49:12.1544330Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:49:12.1575373Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:49:12.1759324Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:49:12.1788880Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:49:12.2034074Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:49:12.2098081Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:49:12.2240995Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:49:12.2243543Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:49:12.2250386Z Requirement already satisfied: tomli>=1.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build->-r requirements.txt (line 14)) (2.2.1) 2025-05-07T19:49:12.2368668Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:49:12.2373082Z Requirement already satisfied: exceptiongroup>=1.0.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from hypothesis->-r requirements.txt (line 17)) (1.2.2) 2025-05-07T19:49:12.2378329Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:49:12.2399164Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:49:12.2517584Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:12.2548877Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:49:12.2617025Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:49:12.2661269Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:49:12.2669143Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:49:12.3066173Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:12.3100152Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:49:12.3208292Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:49:12.3298299Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:49:12.4921243Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 173.6 MB/s eta 0:00:00 2025-05-07T19:49:12.4958811Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:49:12.5053296Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:49:12.5118826Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:49:12.5184519Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:49:12.5256131Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:49:12.5345620Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:49:12.5416053Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:49:12.7199216Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:49:13.6454469Z 2025-05-07T19:49:13.6513876Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:49:13.6516326Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:13.7842591Z ################################################################################ 2025-05-07T19:49:13.7843509Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:49:13.7843885Z # 2025-05-07T19:49:13.7859605Z # [2025-05-07T19:49:13.785Z] + install_triton_pip build_binary 2025-05-07T19:49:13.7860870Z ################################################################################ 2025-05-07T19:49:13.7861588Z 2025-05-07T19:49:13.7862264Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:49:13.7863603Z ################################################################################ 2025-05-07T19:49:13.7864684Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:49:13.7865667Z # 2025-05-07T19:49:13.7878725Z # [2025-05-07T19:49:13.787Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:13.7880449Z ################################################################################ 2025-05-07T19:49:13.7881128Z 2025-05-07T19:49:13.7896396Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:13.8702690Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:13.8703291Z ################################################################################ 2025-05-07T19:49:13.8703821Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:49:13.8704175Z # 2025-05-07T19:49:13.8723201Z # [2025-05-07T19:49:13.871Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:13.8723769Z ################################################################################ 2025-05-07T19:49:13.8724047Z 2025-05-07T19:49:13.8774969Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:49:13.8788176Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:49:13.8789773Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:13.8795222Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:13.8804285Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:49:13.8831578Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:19.4089581Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:49:19.4091200Z torch 2.8.0.dev20250507+cu126 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:49:19.4093349Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:19.4094833Z 2025-05-07T19:49:19.4095266Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:19.4095738Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:19.4096616Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:49:19.4097975Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:49:19.4099214Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 182.3 MB/s eta 0:00:00 2025-05-07T19:49:19.4099661Z Installing collected packages: pytorch-triton 2025-05-07T19:49:19.4100068Z Attempting uninstall: pytorch-triton 2025-05-07T19:49:19.4100491Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:49:19.4100982Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:49:19.4101436Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:49:19.4101969Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:49:19.4102257Z 2025-05-07T19:49:21.5121668Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:49:21.5123475Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:49:23.5742070Z ################################################################################ 2025-05-07T19:49:23.5743429Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:49:23.5744592Z ################################################################################ 2025-05-07T19:49:23.5745261Z 2025-05-07T19:49:25.5326593Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:49:27.5317354Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:49:27.5318522Z [BUILD] Successfully ran git submodules update 2025-05-07T19:49:27.5394225Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:27.5394979Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:27.5395580Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:27.5396016Z env: 2025-05-07T19:49:27.5396406Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:27.5396735Z BUILD_ENV: build_binary 2025-05-07T19:49:27.5396985Z BUILD_TARGET: default 2025-05-07T19:49:27.5397230Z BUILD_VARIANT: cuda 2025-05-07T19:49:27.5397482Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:27.5397761Z ##[endgroup] 2025-05-07T19:49:27.9600927Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:49:27.9602000Z [BUILD] Extracted build target: default 2025-05-07T19:49:27.9602977Z [BUILD] Extracted build variant: cuda 2025-05-07T19:49:29.7445437Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:49:29.7446228Z 2025-05-07T19:49:29.8026641Z [CHECK] Binary cc found in PATH 2025-05-07T19:49:31.6011317Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:49:31.6011710Z 2025-05-07T19:49:31.6799478Z [CHECK] Binary gcc found in PATH 2025-05-07T19:49:33.4806094Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:49:33.4806433Z 2025-05-07T19:49:33.5606445Z [CHECK] Binary c++ found in PATH 2025-05-07T19:49:35.3554164Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:49:35.3554447Z 2025-05-07T19:49:35.4320630Z [CHECK] Binary g++ found in PATH 2025-05-07T19:49:37.2978018Z [BUILD] Extracted and set Python tag: py310 2025-05-07T19:49:37.2979415Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:49:37.3227268Z core = 24 2025-05-07T19:49:37.3437542Z sockets = 2 2025-05-07T19:49:37.3437911Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:49:37.3438292Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:49:37.3438602Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:49:37.3438938Z + rm -rf dist 2025-05-07T19:49:37.3439085Z 2025-05-07T19:49:37.3452699Z 2025-05-07T19:49:37.3453143Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:49:37.3453818Z 2025-05-07T19:49:40.5161152Z INFO:root:running clean 2025-05-07T19:49:40.5162086Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:49:40.5165150Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:49:40.5168235Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:49:40.5168684Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:49:40.5169232Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:49:40.5169769Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:49:40.5170354Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:49:40.5170738Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:49:40.5171924Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:49:40.8725102Z 2025-05-07T19:49:40.8725577Z [BUILD] Printing git status ... 2025-05-07T19:49:40.8725954Z + git status 2025-05-07T19:49:40.8726099Z 2025-05-07T19:49:41.2674947Z HEAD detached at pull/4066/merge 2025-05-07T19:49:41.2675700Z Untracked files: 2025-05-07T19:49:41.2676182Z (use "git add ..." to include in what will be committed) 2025-05-07T19:49:41.2676554Z ../build_only/ 2025-05-07T19:49:41.2676792Z ../collect_env.py 2025-05-07T19:49:41.2697708Z fbgemm_gpu/docs/version.py 2025-05-07T19:49:41.2698124Z 2025-05-07T19:49:41.2698646Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:49:41.2699009Z 2025-05-07T19:49:41.2699174Z + git diff 2025-05-07T19:49:41.2699306Z 2025-05-07T19:49:41.2995497Z 2025-05-07T19:49:41.2995884Z ################################################################################ 2025-05-07T19:49:41.2996367Z # Configure FBGEMM-GPU Build 2025-05-07T19:49:41.2996831Z # 2025-05-07T19:49:41.3014666Z # [2025-05-07T19:49:41.300Z] + __configure_fbgemm_gpu_build 2025-05-07T19:49:41.3015451Z ################################################################################ 2025-05-07T19:49:41.3015737Z 2025-05-07T19:49:41.3020466Z [BUILD] Setting the build target: default ... 2025-05-07T19:49:41.3021677Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:49:43.1527038Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:49:43.1527393Z 2025-05-07T19:49:43.2296931Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:49:45.0826388Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:45.0826758Z 2025-05-07T19:49:45.1585886Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:49:47.0177466Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:47.0178213Z 2025-05-07T19:49:47.0788817Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:49:48.9140263Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:49:48.9140685Z 2025-05-07T19:49:48.9749632Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:49:50.8527863Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:49:50.8528474Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:49:50.8528863Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:49:50.8529241Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:49:50.8529671Z Build cuda_12.6.r12.6/compiler.35059454_0 ... 2025-05-07T19:49:50.8530130Z [BUILD] Setting the following CUDA targets: 7.0;8.0;9.0;9.0a 2025-05-07T19:49:50.8530547Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:49:52.7567004Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:49:56.6229532Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:49:56.6230777Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:49:56.6231641Z 2025-05-07T19:49:57.0279320Z 2025-05-07T19:49:57.0280405Z [BUILD] Setting CUDA build args ... 2025-05-07T19:49:58.8703347Z [BUILD] Looking up CUDA version ... 2025-05-07T19:50:02.5550177Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:02.5551053Z 2025-05-07T19:50:04.3909817Z 2025-05-07T19:50:04.3910634Z [BUILD] Setting NVCC flags ... 2025-05-07T19:50:04.3913142Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++20 -Xcompiler -std=c++20 -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:50:04.3915365Z 2025-05-07T19:50:04.7941636Z 2025-05-07T19:50:04.7942573Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:50:04.7943433Z 2025-05-07T19:50:06.5770737Z -std=c++20 -Xcompiler -std=c++20 -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:50:06.5772299Z 2025-05-07T19:50:06.6352017Z 2025-05-07T19:50:06.6352678Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:06.6353696Z + conda run -n build_binary c++ --version 2025-05-07T19:50:06.6354344Z 2025-05-07T19:50:08.4397854Z c++ (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:50:08.4399045Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:50:08.4400398Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:50:08.4402115Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:50:08.4403132Z 2025-05-07T19:50:08.4403145Z 2025-05-07T19:50:08.4978693Z 2025-05-07T19:50:08.4980321Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:08.4981212Z 2025-05-07T19:50:10.3399159Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:50:10.3399627Z 2025-05-07T19:50:10.3399800Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:50:10.3402463Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 --debug 2025-05-07T19:50:10.3404740Z ################################################################################ 2025-05-07T19:50:10.3405089Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:50:10.3405410Z # 2025-05-07T19:50:10.3420607Z # [2025-05-07T19:50:10.341Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:50:10.3421193Z ################################################################################ 2025-05-07T19:50:10.3421472Z 2025-05-07T19:50:10.3421679Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:50:10.3426288Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' --config-setting=--build-option=-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCMAKE_CXX_STANDARD=20 --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py310 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:50:10.3431100Z 2025-05-07T19:50:12.1693920Z * Getting build dependencies for wheel... 2025-05-07T19:50:13.4868713Z INFO:root:running egg_info 2025-05-07T19:50:13.4894668Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:50:13.4895969Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:50:13.4897552Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:50:13.4900694Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:50:13.4901318Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:50:13.4959403Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:13.4961170Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:13.4976891Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:13.4978487Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:50:13.4980837Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:13.4982048Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:50:13.4982675Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:13.4983216Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:13.4984064Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:13.4984619Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:13.4985036Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:13.4986227Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:50:13.7739336Z * Building wheel... 2025-05-07T19:50:15.0651289Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-pzkwk5og', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--debug', '--package_channel=nightly', '--python-tag=py310', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:15.0655594Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:15.0658439Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-pzkwk5og', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--python-tag=py310', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:15.0660449Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:15.0661023Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:15.0661575Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:15.0662137Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:15.0662536Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:15.0667039Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20'] 2025-05-07T19:50:15.0671568Z 2025-05-07T19:50:15.0671573Z 2025-05-07T19:50:15.0671868Z -------------------------------------------------------------------------------- 2025-05-07T19:50:15.0672278Z -- Trying 'Ninja' generator 2025-05-07T19:50:15.0672558Z -------------------------------- 2025-05-07T19:50:15.0672857Z --------------------------- 2025-05-07T19:50:15.0673110Z ---------------------- 2025-05-07T19:50:15.0673375Z ----------------- 2025-05-07T19:50:15.0673597Z ------------ 2025-05-07T19:50:15.0673840Z ------- 2025-05-07T19:50:15.0674081Z -- 2025-05-07T19:50:15.1056485Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:50:15.1057094Z Not searching for unused variables given on the command line. 2025-05-07T19:50:15.1057655Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:50:15.1058127Z CMake. 2025-05-07T19:50:15.1058256Z 2025-05-07T19:50:15.1058494Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:50:15.1059090Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:50:15.1059634Z to work with policies introduced by or earlier. 2025-05-07T19:50:15.1059895Z 2025-05-07T19:50:15.1059901Z 2025-05-07T19:50:15.1504654Z -- The C compiler identification is GNU 11.4.0 2025-05-07T19:50:15.1579612Z -- Detecting C compiler ABI info 2025-05-07T19:50:15.2447077Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:15.2624706Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc - skipped 2025-05-07T19:50:15.2626576Z -- Detecting C compile features 2025-05-07T19:50:15.2627316Z -- Detecting C compile features - done 2025-05-07T19:50:15.3427514Z -- The CXX compiler identification is GNU 11.4.0 2025-05-07T19:50:15.3513897Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:15.4604910Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:15.4800580Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ - skipped 2025-05-07T19:50:15.4803481Z -- Detecting CXX compile features 2025-05-07T19:50:15.4811605Z -- Detecting CXX compile features - done 2025-05-07T19:50:15.4878077Z -- Configuring done (0.4s) 2025-05-07T19:50:15.4923999Z -- Generating done (0.0s) 2025-05-07T19:50:15.4943303Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:50:15.4993180Z -- 2025-05-07T19:50:15.4993848Z ------- 2025-05-07T19:50:15.4994440Z ------------ 2025-05-07T19:50:15.4995072Z ----------------- 2025-05-07T19:50:15.4995744Z ---------------------- 2025-05-07T19:50:15.4996084Z --------------------------- 2025-05-07T19:50:15.4996410Z -------------------------------- 2025-05-07T19:50:15.4996714Z -- Trying 'Ninja' generator - success 2025-05-07T19:50:15.4997134Z -------------------------------------------------------------------------------- 2025-05-07T19:50:15.4997429Z 2025-05-07T19:50:15.5014005Z Configuring Project 2025-05-07T19:50:15.5014814Z Working directory: 2025-05-07T19:50:15.5015914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build 2025-05-07T19:50:15.5017128Z Command: 2025-05-07T19:50:15.5038537Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install -DPYTHON_VERSION_STRING:STRING=3.10.17 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.10 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.10.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.10 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.10 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release 2025-05-07T19:50:15.5058324Z 2025-05-07T19:50:15.5459956Z 2025-05-07T19:50:15.5460757Z Not searching for unused variables given on the command line. 2025-05-07T19:50:15.5461164Z 2025-05-07T19:50:15.5461349Z ================================================================================ 2025-05-07T19:50:15.5461714Z Default C compiler flags 2025-05-07T19:50:15.5462200Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:15.5462509Z 2025-05-07T19:50:15.5463165Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib 2025-05-07T19:50:15.5463890Z ================================================================================ 2025-05-07T19:50:15.5464140Z 2025-05-07T19:50:15.5464145Z 2025-05-07T19:50:15.5464149Z 2025-05-07T19:50:15.5464301Z ================================================================================ 2025-05-07T19:50:15.5464658Z Default C++ compiler flags 2025-05-07T19:50:15.5465059Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:15.5465368Z 2025-05-07T19:50:15.5465822Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib 2025-05-07T19:50:15.5466533Z ================================================================================ 2025-05-07T19:50:15.5466771Z 2025-05-07T19:50:15.5466775Z 2025-05-07T19:50:15.5466806Z 2025-05-07T19:50:15.5466932Z ================================================================================ 2025-05-07T19:50:15.5467259Z AVX2_FLAGS: 2025-05-07T19:50:15.5467410Z 2025-05-07T19:50:15.5467500Z -mavx2 2025-05-07T19:50:15.5467712Z -mf16c 2025-05-07T19:50:15.5467940Z -mfma 2025-05-07T19:50:15.5468170Z -fopenmp 2025-05-07T19:50:15.5468409Z ================================================================================ 2025-05-07T19:50:15.5468641Z 2025-05-07T19:50:15.5468645Z 2025-05-07T19:50:15.5468649Z 2025-05-07T19:50:15.5468793Z ================================================================================ 2025-05-07T19:50:15.5469116Z AVX512_FLAGS: 2025-05-07T19:50:15.5469273Z 2025-05-07T19:50:15.5469364Z -mavx2 2025-05-07T19:50:15.5469569Z -mf16c 2025-05-07T19:50:15.5469803Z -mfma 2025-05-07T19:50:15.5470005Z -mavx512f 2025-05-07T19:50:15.5470242Z -mavx512bw 2025-05-07T19:50:15.5470479Z -mavx512dq 2025-05-07T19:50:15.5470689Z -mavx512vl 2025-05-07T19:50:15.5470924Z -fopenmp 2025-05-07T19:50:15.5471170Z ================================================================================ 2025-05-07T19:50:15.5471505Z 2025-05-07T19:50:15.5471547Z 2025-05-07T19:50:15.5471551Z 2025-05-07T19:50:15.5471762Z ================================================================================ 2025-05-07T19:50:15.5472236Z The project is built using scikit-build 2025-05-07T19:50:15.5472553Z ================================================================================ 2025-05-07T19:50:15.5472768Z 2025-05-07T19:50:15.5472772Z 2025-05-07T19:50:15.5472801Z 2025-05-07T19:50:15.5472914Z ================================================================================ 2025-05-07T19:50:15.5473215Z Build Settings 2025-05-07T19:50:15.5473374Z 2025-05-07T19:50:15.5473479Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:50:15.5473759Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:50:15.5473956Z 2025-05-07T19:50:15.5474054Z NVCC_VERBOSE : 2025-05-07T19:50:15.5474328Z CUDNN_INCLUDE_DIR : 2025-05-07T19:50:15.5474581Z CUDNN_LIBRARY : 2025-05-07T19:50:15.5475026Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:15.5475484Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:50:15.5475761Z 8.0 2025-05-07T19:50:15.5475958Z 9.0 2025-05-07T19:50:15.5476261Z 9.0a 2025-05-07T19:50:15.5476369Z 2025-05-07T19:50:15.5476632Z HIP_ROOT_DIR : 2025-05-07T19:50:15.5476930Z HIPCC_VERBOSE : 2025-05-07T19:50:15.5477221Z AMDGPU_TARGETS : 2025-05-07T19:50:15.5477558Z PYTORCH_ROCM_ARCH : 2025-05-07T19:50:15.5477873Z ================================================================================ 2025-05-07T19:50:15.5478107Z 2025-05-07T19:50:15.6249392Z -- The CXX compiler identification is GNU 11.4.0 2025-05-07T19:50:15.6647051Z -- The C compiler identification is GNU 11.4.0 2025-05-07T19:50:16.5969037Z -- The CUDA compiler identification is NVIDIA 12.6.85 with host compiler GNU 11.4.0 2025-05-07T19:50:16.6065441Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:16.7017095Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:16.7205215Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ - skipped 2025-05-07T19:50:16.7207119Z -- Detecting CXX compile features 2025-05-07T19:50:16.7213219Z -- Detecting CXX compile features - done 2025-05-07T19:50:16.7328436Z -- Detecting C compiler ABI info 2025-05-07T19:50:16.8198105Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:16.8382256Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc - skipped 2025-05-07T19:50:16.8382911Z -- Detecting C compile features 2025-05-07T19:50:16.8385897Z -- Detecting C compile features - done 2025-05-07T19:50:16.8486889Z -- Detecting CUDA compiler ABI info 2025-05-07T19:50:17.7661176Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:50:17.8224392Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:50:17.8258157Z -- Detecting CUDA compile features 2025-05-07T19:50:17.8259235Z -- Detecting CUDA compile features - done 2025-05-07T19:50:17.8349733Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:50:18.1026277Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:50:18.1026784Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:50:18.3799948Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:50:18.3801691Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:50:18.6399898Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:50:18.6400384Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:50:18.9138873Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:50:18.9139376Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:50:19.1751096Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:50:19.1751560Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:50:19.3962275Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:50:19.3963767Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:50:19.6563166Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:50:19.6563675Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:50:19.9319932Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:50:19.9320677Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:50:20.1917949Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:50:20.1918354Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:50:20.4638575Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:50:20.4639048Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:50:20.7269430Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:50:20.7269887Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:50:20.9469238Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:50:20.9645209Z -- Found CUDA: /github/home/miniconda/envs/build_binary/targets/x86_64-linux (found version "12.6") 2025-05-07T19:50:20.9679938Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include (found version "12.6.85") 2025-05-07T19:50:20.9758826Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:50:21.0666807Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Failed 2025-05-07T19:50:21.0667356Z -- Looking for pthread_create in pthreads 2025-05-07T19:50:21.1468630Z -- Looking for pthread_create in pthreads - not found 2025-05-07T19:50:21.1469120Z -- Looking for pthread_create in pthread 2025-05-07T19:50:21.2388789Z -- Looking for pthread_create in pthread - found 2025-05-07T19:50:21.2395362Z -- Found Threads: TRUE 2025-05-07T19:50:21.3997426Z -- PyTorch: CUDA detected: 12.6 2025-05-07T19:50:21.3998392Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/bin/nvcc 2025-05-07T19:50:21.3999199Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary/targets/x86_64-linux 2025-05-07T19:50:21.5234485Z -- PyTorch: Header version is: 12.6 2025-05-07T19:50:21.6076299Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.10.17") found components: Interpreter 2025-05-07T19:50:21.6086075Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:50:21.6086903Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:50:21.6087290Z Call Stack (most recent call first): 2025-05-07T19:50:21.6087975Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:21.6089082Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:21.6089933Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:21.6090369Z CMakeLists.txt:112 (include) 2025-05-07T19:50:21.6090549Z 2025-05-07T19:50:21.6090555Z 2025-05-07T19:50:21.6090723Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:50:21.6091176Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:50:21.6091643Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:50:21.6092066Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:50:21.6092963Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_90a,code=sm_90a 2025-05-07T19:50:21.6432495Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:50:21.6433381Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:50:21.6433801Z Call Stack (most recent call first): 2025-05-07T19:50:21.6434639Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:50:21.6435566Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:21.6436061Z CMakeLists.txt:112 (include) 2025-05-07T19:50:21.6436358Z 2025-05-07T19:50:21.6436387Z 2025-05-07T19:50:21.6437684Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so 2025-05-07T19:50:21.6438448Z 2025-05-07T19:50:21.6438452Z 2025-05-07T19:50:21.6438586Z ================================================================================ 2025-05-07T19:50:21.6438970Z PyTorch Flags: 2025-05-07T19:50:21.6439209Z 2025-05-07T19:50:21.6439438Z TORCH_INCLUDE_DIRS: 2025-05-07T19:50:21.6439874Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:21.6440691Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.6441284Z 2025-05-07T19:50:21.6441526Z TORCH_LIBRARIES: 2025-05-07T19:50:21.6441784Z torch 2025-05-07T19:50:21.6441999Z torch_library 2025-05-07T19:50:21.6442484Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.6443183Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.6443911Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.6444449Z 2025-05-07T19:50:21.6444698Z TORCH_CUDA_OPTIONS: 2025-05-07T19:50:21.6444987Z --expt-relaxed-constexpr 2025-05-07T19:50:21.6445273Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:21.6445601Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:21.6445918Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:21.6446244Z ================================================================================ 2025-05-07T19:50:21.6446624Z 2025-05-07T19:50:21.6446629Z 2025-05-07T19:50:21.6446632Z 2025-05-07T19:50:21.6446743Z ================================================================================ 2025-05-07T19:50:21.6447062Z NCCL Flags 2025-05-07T19:50:21.6447180Z 2025-05-07T19:50:21.6447573Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.6448573Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.6449222Z ================================================================================ 2025-05-07T19:50:21.6449446Z 2025-05-07T19:50:21.6449450Z 2025-05-07T19:50:21.6449454Z 2025-05-07T19:50:21.6449564Z ================================================================================ 2025-05-07T19:50:21.6449889Z CUDA Driver Path 2025-05-07T19:50:21.6450026Z 2025-05-07T19:50:21.6450394Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.6450960Z ================================================================================ 2025-05-07T19:50:21.6451198Z 2025-05-07T19:50:21.6451483Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.6469899Z 2025-05-07T19:50:21.6469927Z 2025-05-07T19:50:21.6470180Z ================================================================================ 2025-05-07T19:50:21.6470673Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:50:21.6471313Z 2025-05-07T19:50:21.6471617Z CPU_SRCS: 2025-05-07T19:50:21.6471757Z 2025-05-07T19:50:21.6471841Z 2025-05-07T19:50:21.6472048Z GPU_SRCS: 2025-05-07T19:50:21.6472195Z 2025-05-07T19:50:21.6472278Z 2025-05-07T19:50:21.6472482Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.6472653Z 2025-05-07T19:50:21.6472753Z 2025-05-07T19:50:21.6472944Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.6473088Z 2025-05-07T19:50:21.6473183Z 2025-05-07T19:50:21.6473365Z OTHER_SRCS: 2025-05-07T19:50:21.6473762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:21.6474378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:21.6474988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:21.6475611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:21.6476346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:21.6477218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:21.6477802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:21.6478399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:21.6478971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:21.6479588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:21.6480192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:21.6480790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:21.6481393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:21.6481976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:21.6482573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:21.6483188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:21.6483777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:21.6484378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:21.6484959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:21.6485555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:21.6486135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:21.6486742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:21.6488638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:21.6489259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:21.6489881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:21.6490454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:21.6491062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:21.6491676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:21.6492234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:21.6492803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:21.6493392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:21.6494012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:21.6494595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:21.6495171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:21.6495749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:21.6496306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:21.6496877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:21.6497435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:21.6498004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:21.6498561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:21.6499135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:21.6499701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:21.6500253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:21.6500898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:21.6501455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:21.6502049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:21.6502645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:21.6503227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:21.6503826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:21.6504424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:21.6505023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:21.6505612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:21.6506222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:21.6506838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:21.6507411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:21.6507993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:21.6508566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:21.6509160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:21.6509734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:21.6510171Z 2025-05-07T19:50:21.6510374Z CC_FLAGS: 2025-05-07T19:50:21.6510491Z 2025-05-07T19:50:21.6510627Z 2025-05-07T19:50:21.6510814Z NVCC_FLAGS: 2025-05-07T19:50:21.6510932Z 2025-05-07T19:50:21.6511092Z 2025-05-07T19:50:21.6511281Z HIPCC_FLAGS: 2025-05-07T19:50:21.6511424Z 2025-05-07T19:50:21.6511504Z 2025-05-07T19:50:21.6511695Z INCLUDE_DIRS: 2025-05-07T19:50:21.6511956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.6512303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.6512628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.6512956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.6513475Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:21.6514282Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.6514923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.6515373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.6515797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.6516369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.6516903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.6517379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.6517947Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.6518456Z 2025-05-07T19:50:21.6518698Z Selected Source Files: 2025-05-07T19:50:21.6518857Z 2025-05-07T19:50:21.6518937Z 2025-05-07T19:50:21.6519157Z HIPified Source Files: 2025-05-07T19:50:21.6519322Z 2025-05-07T19:50:21.6519412Z 2025-05-07T19:50:21.6519660Z Library Dependencies: 2025-05-07T19:50:21.6519936Z torch 2025-05-07T19:50:21.6520153Z torch_library 2025-05-07T19:50:21.6520633Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.6521325Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.6522067Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.6522877Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.6523746Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.6524404Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.6524834Z 2025-05-07T19:50:21.6525078Z Output Library: 2025-05-07T19:50:21.6525326Z asmjit 2025-05-07T19:50:21.6525582Z 2025-05-07T19:50:21.6525812Z Destination Directory: 2025-05-07T19:50:21.6526105Z fbgemm_gpu 2025-05-07T19:50:21.6526362Z ================================================================================ 2025-05-07T19:50:21.6526634Z 2025-05-07T19:50:21.6526638Z 2025-05-07T19:50:21.6526642Z 2025-05-07T19:50:21.6526767Z ================================================================================ 2025-05-07T19:50:21.6527161Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:50:21.6527472Z 2025-05-07T19:50:21.6527724Z CPU_SRCS: 2025-05-07T19:50:21.6527850Z 2025-05-07T19:50:21.6527944Z 2025-05-07T19:50:21.6528177Z GPU_SRCS: 2025-05-07T19:50:21.6528309Z 2025-05-07T19:50:21.6528401Z 2025-05-07T19:50:21.6528653Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:21.6528806Z 2025-05-07T19:50:21.6528898Z 2025-05-07T19:50:21.6529155Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:21.6529313Z 2025-05-07T19:50:21.6529442Z 2025-05-07T19:50:21.6529657Z OTHER_SRCS: 2025-05-07T19:50:21.6529987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:50:21.6530456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:21.6530973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:21.6531408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:50:21.6531886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:50:21.6532393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:21.6532977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:50:21.6533410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:50:21.6533834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:21.6534325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:21.6534782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:21.6535274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:21.6535735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:21.6536165Z 2025-05-07T19:50:21.6536380Z CC_FLAGS: 2025-05-07T19:50:21.6536540Z 2025-05-07T19:50:21.6536635Z 2025-05-07T19:50:21.6536893Z NVCC_FLAGS: 2025-05-07T19:50:21.6537029Z 2025-05-07T19:50:21.6537125Z 2025-05-07T19:50:21.6537380Z HIPCC_FLAGS: 2025-05-07T19:50:21.6537523Z 2025-05-07T19:50:21.6537615Z 2025-05-07T19:50:21.6537855Z INCLUDE_DIRS: 2025-05-07T19:50:21.6538117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.6538481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:21.6538788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:21.6539145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:21.6539659Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:21.6540484Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:21.6541175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:21.6541606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:21.6542079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:21.6542571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:21.6543140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:21.6543645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:21.6544227Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:21.6544788Z 2025-05-07T19:50:21.6545018Z Selected Source Files: 2025-05-07T19:50:21.6545250Z 2025-05-07T19:50:21.6545369Z 2025-05-07T19:50:21.6545593Z HIPified Source Files: 2025-05-07T19:50:21.6545792Z 2025-05-07T19:50:21.6545885Z 2025-05-07T19:50:21.6546111Z Library Dependencies: 2025-05-07T19:50:21.6546593Z torch 2025-05-07T19:50:21.6546820Z torch_library 2025-05-07T19:50:21.6547318Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:21.6548060Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:21.6548784Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:21.6549633Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:21.6550387Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:21.6550912Z asmjit 2025-05-07T19:50:21.6551264Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:21.6551719Z 2025-05-07T19:50:21.6551968Z Output Library: 2025-05-07T19:50:21.6552205Z fbgemm 2025-05-07T19:50:21.6552446Z 2025-05-07T19:50:21.6552665Z Destination Directory: 2025-05-07T19:50:21.6552951Z fbgemm_gpu 2025-05-07T19:50:21.6553207Z ================================================================================ 2025-05-07T19:50:21.6553477Z 2025-05-07T19:50:21.6553482Z 2025-05-07T19:50:21.6553485Z 2025-05-07T19:50:21.6553612Z ================================================================================ 2025-05-07T19:50:21.6553969Z Running code generation script ... 2025-05-07T19:50:21.6554764Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:50:21.6555575Z ================================================================================ 2025-05-07T19:50:21.6555947Z 2025-05-07T19:50:22.4223446Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:22.4224485Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:50:22.4225268Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:22.4225772Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:22.4226313Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.4226843Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:22.4227375Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:22.4227873Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:22.4228399Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:22.4228962Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.4229499Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:22.4230050Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:22.4230578Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.4231141Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.4231686Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.4232285Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.4232862Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.4233403Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.4233968Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.4234525Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.4235137Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.4235778Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.4236649Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:50:22.4237136Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:22.4237538Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:22.4238038Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:22.4238589Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.4239119Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:22.4239648Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:22.4240177Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.4240753Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:22.4241325Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.4241894Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.4242512Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.4243063Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.4243679Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.4244275Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.4244851Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:50:22.4245345Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:22.4245773Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:22.4246287Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.4246892Z Written: lookup_adagrad.py 2025-05-07T19:50:22.4247426Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:22.4247861Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:22.4248372Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.4248907Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:22.4249395Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:22.4249938Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.4250465Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:22.4251000Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:22.4251489Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:22.4252003Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:22.4252529Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.4253063Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:22.4253588Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:22.4254111Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.4254669Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.4255206Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.4255805Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.4256372Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.4256910Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.4257473Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.4258020Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.4258730Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.4259274Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.4259999Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:50:22.4260445Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:22.4260815Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:22.4261274Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.4261666Z Written: lookup_adam.py 2025-05-07T19:50:22.4261997Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:22.4262417Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.4262892Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:22.4263363Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.4263862Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:22.4264345Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:22.4264819Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.4265331Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:22.4265799Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.4266333Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.4266851Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.4267365Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.4267896Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.4268409Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.4268908Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:50:22.4269313Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:22.4269795Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:22.4270216Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.4270630Z Written: lookup_lamb.py 2025-05-07T19:50:22.4270956Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:22.4271376Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.4271870Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:22.4272362Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.4272890Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:22.4273356Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:22.4273878Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.4274417Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:22.4274911Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.4275469Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.4276015Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.4276790Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.4277369Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.4277987Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.4278565Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:50:22.4279028Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:22.4279471Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:22.4279953Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.4280411Z Written: lookup_lars_sgd.py 2025-05-07T19:50:22.4280761Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:22.4281262Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.4281841Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:22.4282533Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.4283190Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:22.4283789Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:22.4284457Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.4285099Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:22.4285761Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.4286462Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.4287143Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.4287812Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.4288490Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.4289304Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.5617243Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:50:22.5617872Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:22.5618422Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:22.5619004Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.5619540Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:50:22.5619981Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:22.5620827Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.5621488Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:22.5622125Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.5622788Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:22.5623398Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:22.5624070Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.5624738Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:22.5625411Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.5626125Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.5626846Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.5627505Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.5628228Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.5628919Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.5629712Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:50:22.5630339Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:22.5630845Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:22.5631421Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.5631893Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:50:22.5632328Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:22.5632868Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.5633458Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:22.5634117Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:22.5634662Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:22.5635191Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:22.5635719Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.5636395Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.5637144Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:22.5637756Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:22.5638362Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:22.5638923Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:22.5639518Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:22.5640108Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:22.5640703Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:50:22.5641257Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:22.5641871Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.5642512Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.5643120Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:22.5643739Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:22.5644321Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:50:22.5645026Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:22.5645623Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.5646282Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.5647094Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:22.5647680Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.5648330Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.5648988Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.5649672Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.5650347Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.5650971Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:22.5651599Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.5652214Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.5652859Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.5653461Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:22.5654076Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.5654725Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.5655375Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.5656051Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.5656677Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.5657330Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:22.5657934Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.5658794Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:22.5659445Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:22.5660158Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:22.5660798Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:22.5661404Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:22.5662030Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:22.5662667Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:22.5663281Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:22.5663886Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:22.5664439Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:22.5665018Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:22.5665580Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:22.5666134Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:50:22.5666679Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:22.5667152Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:22.5667600Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:22.5668082Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.5668618Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:50:22.5668988Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:22.5669454Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:22.5669979Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.5670412Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:50:22.5670814Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:22.5671266Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:22.5671794Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.5672357Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:22.5672920Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:22.5673435Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:22.5673982Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.5674551Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:22.5675100Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.5675748Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:22.5676417Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:22.5677216Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:22.5677934Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.5678610Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:22.5679304Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.5680053Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:22.5680786Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:22.5681645Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:22.5682373Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.7206562Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:22.7207406Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.7208173Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:22.7208850Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:22.7209565Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.7210291Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:22.7210999Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:22.7211691Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:22.7212389Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:22.7213104Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.7213822Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:22.7214541Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:22.7215250Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.7216200Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.7216967Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.7217732Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.7218495Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.7219317Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.7220036Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.7220774Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.7221502Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.7222238Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.7222912Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:22.7223541Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:22.7224089Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:22.7224728Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.7225284Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:50:22.7225754Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:22.7226394Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.7227065Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:22.7227734Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:22.7228366Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:22.7229023Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.7229828Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:22.7230485Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.7231180Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:50:22.7231747Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:22.7232302Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:22.7232922Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.7233517Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:22.7234140Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.7234705Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:22.7235206Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:22.7235686Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.7236293Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:22.7236975Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:22.7237462Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:22.7237980Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:22.7238482Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.7239044Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:22.7239553Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:22.7240159Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.7240713Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.7241245Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.7241836Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:22.7242380Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.7242950Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.7243485Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.7244050Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.7244641Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:22.7245182Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.7245708Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:50:22.7246143Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:22.7246704Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:22.7247159Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.7247601Z Written: lookup_sgd.py 2025-05-07T19:50:22.7247943Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:22.7248341Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:22.7248809Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.7249322Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:50:22.7249824Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:22.7250255Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:22.7250782Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.7251308Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:22.7251800Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.7252328Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:22.7252921Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:22.7253460Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:22.7253938Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:22.7254464Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:22.7254974Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:22.7255506Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:22.7256079Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:22.7256633Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:22.7257183Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:22.7257730Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:22.7258317Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:22.7258928Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:50:22.7259382Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:50:22.7259890Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:22.7260309Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:22.7260722Z Written: lookup_none.py 2025-05-07T19:50:22.7261022Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:22.7261472Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:22.7261955Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:22.7262515Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:22.7263160Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:22.7263666Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:22.7264191Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:22.7264682Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:50:22.7265183Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:50:22.7265679Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:22.7266236Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:22.7266784Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:22.7267298Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:22.7267811Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:50:22.7268286Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:50:22.7268778Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:22.7269230Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:22.7269730Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:22.7270251Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:22.7270744Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:22.7271177Z Written: pt2_arg_utils.h 2025-05-07T19:50:22.7271437Z Written: __init__.py 2025-05-07T19:50:22.7271718Z Written: lookup_args_ssd.py 2025-05-07T19:50:22.7271988Z Written: lookup_args.py 2025-05-07T19:50:22.7327853Z 2025-05-07T19:50:22.7327860Z 2025-05-07T19:50:22.7328120Z ================================================================================ 2025-05-07T19:50:22.7328584Z Running code generation script ... 2025-05-07T19:50:22.7423513Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:50:22.7424362Z ================================================================================ 2025-05-07T19:50:22.7424803Z 2025-05-07T19:50:22.8400244Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:22.8401154Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:50:22.8401944Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:22.8402475Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:22.8402963Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:22.8403502Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:22.8404013Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:50:22.8404415Z Written: optimizer_args.py 2025-05-07T19:50:22.8485393Z 2025-05-07T19:50:22.8485399Z 2025-05-07T19:50:22.8485677Z ================================================================================ 2025-05-07T19:50:22.8486111Z Running code generation script ... 2025-05-07T19:50:22.8486949Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:50:22.8487755Z ================================================================================ 2025-05-07T19:50:22.8488022Z 2025-05-07T19:50:22.9952557Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:22.9953475Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:50:22.9954370Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:22.9955093Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:22.9956011Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:22.9956840Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:22.9957548Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:22.9958256Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:22.9958990Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:22.9959727Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:22.9960489Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:22.9961222Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:22.9961989Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:22.9962748Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:22.9963476Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:22.9964199Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:22.9964892Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:22.9965612Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:22.9966329Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:22.9967019Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:22.9967713Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:22.9968495Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:22.9969145Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:22.9969824Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:22.9970338Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:23.0038345Z 2025-05-07T19:50:23.0038352Z 2025-05-07T19:50:23.0038620Z ================================================================================ 2025-05-07T19:50:23.0039054Z Running code generation script ... 2025-05-07T19:50:23.0039863Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:50:23.0040656Z ================================================================================ 2025-05-07T19:50:23.0040925Z 2025-05-07T19:50:23.3906034Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:23.3906965Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:50:23.3907770Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:23.3908303Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:23.3908812Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:23.3909355Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:23.3909850Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:23.3910384Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:23.3910872Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:23.3911360Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:23.3911904Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:23.3912697Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:23.3913217Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:23.3913749Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:23.3914272Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:23.3914841Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:23.3915374Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:23.3915943Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:23.3916599Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:23.3917109Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:23.3917659Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:23.3918187Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:23.3918722Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:23.3919242Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:23.3919767Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:23.3920279Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:23.3920784Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:23.3921344Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:23.3921854Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:23.3922376Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:23.3922912Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:23.3923359Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:23.3923847Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:23.3924332Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:23.3924820Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:23.3925419Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:23.3925896Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:23.3926337Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:23.3926791Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:23.3927280Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:23.3927769Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:23.3928286Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:23.3928858Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:23.3929311Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:23.3929726Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:23.3930184Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:23.3930663Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:23.3931111Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:23.3931599Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:23.3932034Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:23.3932496Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:23.3932965Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:23.3933497Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:23.3934017Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:23.3934510Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:23.3935063Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.3935488Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:23.3935929Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:23.4004126Z 2025-05-07T19:50:23.4004181Z 2025-05-07T19:50:23.4004475Z ================================================================================ 2025-05-07T19:50:23.4004897Z Running code generation script ... 2025-05-07T19:50:23.4005688Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:50:23.4006466Z ================================================================================ 2025-05-07T19:50:23.4006741Z 2025-05-07T19:50:23.6899462Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:23.6900424Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:50:23.6901135Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:23.6901622Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:23.6902095Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:23.6902594Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:23.6903085Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:23.6903555Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:23.6904101Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:50:23.6904619Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:23.6905119Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:23.6995656Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:50:23.7005338Z 2025-05-07T19:50:23.7005349Z 2025-05-07T19:50:23.7005722Z ================================================================================ 2025-05-07T19:50:23.7006211Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:50:23.7006827Z 2025-05-07T19:50:23.7007068Z CPU_SRCS: 2025-05-07T19:50:23.7007502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:23.7008205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:23.7008892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:23.7009500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:23.7010149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:23.7010637Z 2025-05-07T19:50:23.7010870Z GPU_SRCS: 2025-05-07T19:50:23.7011232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:23.7011858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:23.7012495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:23.7013174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:23.7013815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:23.7014409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:23.7015065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:23.7015661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:23.7016274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:23.7016957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:23.7017445Z 2025-05-07T19:50:23.7017685Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:23.7017839Z 2025-05-07T19:50:23.7018048Z 2025-05-07T19:50:23.7018297Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:23.7018448Z 2025-05-07T19:50:23.7018537Z 2025-05-07T19:50:23.7018781Z OTHER_SRCS: 2025-05-07T19:50:23.7018916Z 2025-05-07T19:50:23.7019009Z 2025-05-07T19:50:23.7019241Z CC_FLAGS: 2025-05-07T19:50:23.7019363Z 2025-05-07T19:50:23.7019456Z 2025-05-07T19:50:23.7019689Z NVCC_FLAGS: 2025-05-07T19:50:23.7019956Z --expt-relaxed-constexpr 2025-05-07T19:50:23.7020256Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:23.7020595Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:23.7020910Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:23.7021214Z 2025-05-07T19:50:23.7021427Z HIPCC_FLAGS: 2025-05-07T19:50:23.7021587Z 2025-05-07T19:50:23.7021681Z 2025-05-07T19:50:23.7021897Z INCLUDE_DIRS: 2025-05-07T19:50:23.7022189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:23.7022530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:23.7022858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:23.7023230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:23.7023745Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:23.7024604Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:23.7025266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:23.7025725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:23.7026195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:23.7026680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:23.7027250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:23.7027730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:23.7028328Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:23.7028852Z 2025-05-07T19:50:23.7029089Z Selected Source Files: 2025-05-07T19:50:23.7029535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:23.7030237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:23.7031006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:23.7031623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:23.7032274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:23.7032911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:23.7033541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:23.7034204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:23.7034853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:23.7035499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:23.7036094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:23.7036849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:23.7037446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:23.7038059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:23.7038739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:23.7039221Z 2025-05-07T19:50:23.7039473Z HIPified Source Files: 2025-05-07T19:50:23.7039637Z 2025-05-07T19:50:23.7039730Z 2025-05-07T19:50:23.7039972Z Library Dependencies: 2025-05-07T19:50:23.7040227Z torch 2025-05-07T19:50:23.7040470Z torch_library 2025-05-07T19:50:23.7040927Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:23.7041729Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:23.7042475Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:23.7043284Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:23.7044062Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:23.7044676Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:23.7045125Z 2025-05-07T19:50:23.7045339Z Output Library: 2025-05-07T19:50:23.7045613Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:23.7045882Z 2025-05-07T19:50:23.7046097Z Destination Directory: 2025-05-07T19:50:23.7046378Z fbgemm_gpu 2025-05-07T19:50:23.7046790Z ================================================================================ 2025-05-07T19:50:23.7047034Z 2025-05-07T19:50:23.7524366Z 2025-05-07T19:50:23.7524380Z 2025-05-07T19:50:23.7524693Z ================================================================================ 2025-05-07T19:50:23.7525174Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:50:23.7525597Z 2025-05-07T19:50:23.7525839Z CPU_SRCS: 2025-05-07T19:50:23.7526165Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:23.7526665Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:23.7527163Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:23.7527520Z 2025-05-07T19:50:23.7527760Z GPU_SRCS: 2025-05-07T19:50:23.7528053Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:23.7528550Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:23.7529111Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:23.7529760Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:23.7530384Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:23.7531023Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:23.7531871Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:23.7532488Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:23.7533162Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:23.7533883Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:23.7534576Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:23.7535267Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:23.7535935Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:23.7536629Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:23.7537276Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:23.7537940Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:23.7538566Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:23.7539217Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:23.7539869Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:23.7540497Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:23.7541119Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:23.7541714Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:23.7542446Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:23.7542887Z 2025-05-07T19:50:23.7543136Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:23.7543293Z 2025-05-07T19:50:23.7543414Z 2025-05-07T19:50:23.7543637Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:23.7543789Z 2025-05-07T19:50:23.7543917Z 2025-05-07T19:50:23.7544131Z OTHER_SRCS: 2025-05-07T19:50:23.7544293Z 2025-05-07T19:50:23.7544386Z 2025-05-07T19:50:23.7544589Z CC_FLAGS: 2025-05-07T19:50:23.7544739Z 2025-05-07T19:50:23.7544829Z 2025-05-07T19:50:23.7545056Z NVCC_FLAGS: 2025-05-07T19:50:23.7545296Z --expt-relaxed-constexpr 2025-05-07T19:50:23.7545614Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:23.7545915Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:23.7546252Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:23.7546703Z 2025-05-07T19:50:23.7546909Z HIPCC_FLAGS: 2025-05-07T19:50:23.7547043Z 2025-05-07T19:50:23.7547233Z 2025-05-07T19:50:23.7547444Z INCLUDE_DIRS: 2025-05-07T19:50:23.7547738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:23.7548074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:23.7548403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:23.7548733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:23.7549284Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:23.7550121Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:23.7550793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:23.7551255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:23.7551698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:23.7552215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:23.7552749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:23.7553256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:23.7553855Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:23.7554376Z 2025-05-07T19:50:23.7554622Z Selected Source Files: 2025-05-07T19:50:23.7554977Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:23.7555576Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:23.7556039Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:23.7556590Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:23.7557106Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:23.7557706Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:23.7558371Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:23.7558987Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:23.7559593Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:23.7560211Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:23.7560818Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:23.7561459Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:23.7562116Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:23.7562780Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:23.7563425Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:23.7564093Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:23.7564754Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:23.7565383Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:23.7566112Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:23.7566727Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:23.7567354Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:23.7567962Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:23.7568588Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:23.7569183Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:23.7569753Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:23.7570350Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:23.7570758Z 2025-05-07T19:50:23.7570963Z HIPified Source Files: 2025-05-07T19:50:23.7571119Z 2025-05-07T19:50:23.7571196Z 2025-05-07T19:50:23.7571405Z Library Dependencies: 2025-05-07T19:50:23.7571645Z torch 2025-05-07T19:50:23.7571835Z torch_library 2025-05-07T19:50:23.7572277Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:23.7572947Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:23.7573643Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:23.7574426Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:23.7575165Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:23.7575645Z asmjit 2025-05-07T19:50:23.7575838Z fbgemm 2025-05-07T19:50:23.7576049Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:23.7576286Z fbgemm_gpu_config 2025-05-07T19:50:23.7576653Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:23.7577043Z 2025-05-07T19:50:23.7577247Z Output Library: 2025-05-07T19:50:23.7577477Z fbgemm_gpu_tbe_inference 2025-05-07T19:50:23.7577721Z 2025-05-07T19:50:23.7577917Z Destination Directory: 2025-05-07T19:50:23.7578228Z fbgemm_gpu 2025-05-07T19:50:23.7578455Z ================================================================================ 2025-05-07T19:50:23.7578698Z 2025-05-07T19:50:23.9963245Z 2025-05-07T19:50:23.9963435Z 2025-05-07T19:50:23.9963799Z ================================================================================ 2025-05-07T19:50:23.9964266Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:50:23.9964653Z 2025-05-07T19:50:23.9964905Z CPU_SRCS: 2025-05-07T19:50:23.9965150Z src/config/feature_gates.cpp 2025-05-07T19:50:23.9965455Z 2025-05-07T19:50:23.9965660Z GPU_SRCS: 2025-05-07T19:50:23.9965789Z 2025-05-07T19:50:23.9965908Z 2025-05-07T19:50:23.9966125Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:23.9966309Z 2025-05-07T19:50:23.9966404Z 2025-05-07T19:50:23.9966614Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:23.9966791Z 2025-05-07T19:50:23.9966909Z 2025-05-07T19:50:23.9967104Z OTHER_SRCS: 2025-05-07T19:50:23.9967260Z 2025-05-07T19:50:23.9967355Z 2025-05-07T19:50:23.9967612Z CC_FLAGS: 2025-05-07T19:50:23.9967742Z 2025-05-07T19:50:23.9967834Z 2025-05-07T19:50:23.9968074Z NVCC_FLAGS: 2025-05-07T19:50:23.9968207Z 2025-05-07T19:50:23.9968317Z 2025-05-07T19:50:23.9968554Z HIPCC_FLAGS: 2025-05-07T19:50:23.9968691Z 2025-05-07T19:50:23.9968784Z 2025-05-07T19:50:23.9969023Z INCLUDE_DIRS: 2025-05-07T19:50:23.9969308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:23.9969652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:23.9969983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:23.9970316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:23.9970857Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:23.9971665Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:23.9972573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:23.9973000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:23.9973454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:23.9973935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:23.9974445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:23.9974913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:23.9975460Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:23.9975969Z 2025-05-07T19:50:23.9976166Z Selected Source Files: 2025-05-07T19:50:23.9976436Z src/config/feature_gates.cpp 2025-05-07T19:50:23.9976703Z 2025-05-07T19:50:23.9976900Z HIPified Source Files: 2025-05-07T19:50:23.9977052Z 2025-05-07T19:50:23.9977148Z 2025-05-07T19:50:23.9977338Z Library Dependencies: 2025-05-07T19:50:23.9977579Z torch 2025-05-07T19:50:23.9977774Z torch_library 2025-05-07T19:50:23.9978220Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:23.9978895Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:23.9979592Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:23.9980388Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:23.9981111Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:23.9981718Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:23.9982112Z 2025-05-07T19:50:23.9982314Z Output Library: 2025-05-07T19:50:23.9982539Z fbgemm_gpu_config 2025-05-07T19:50:23.9982763Z 2025-05-07T19:50:23.9982957Z Destination Directory: 2025-05-07T19:50:23.9983204Z fbgemm_gpu 2025-05-07T19:50:23.9983435Z ================================================================================ 2025-05-07T19:50:23.9983679Z 2025-05-07T19:50:23.9983716Z 2025-05-07T19:50:23.9983720Z 2025-05-07T19:50:23.9983849Z ================================================================================ 2025-05-07T19:50:23.9984352Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:50:23.9984699Z 2025-05-07T19:50:23.9984888Z CPU_SRCS: 2025-05-07T19:50:23.9985232Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:23.9985744Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:23.9986121Z 2025-05-07T19:50:23.9986360Z GPU_SRCS: 2025-05-07T19:50:23.9986653Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:23.9987107Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:23.9987516Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:23.9987943Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:23.9988366Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:23.9988774Z 2025-05-07T19:50:23.9989031Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:23.9989189Z 2025-05-07T19:50:23.9989289Z 2025-05-07T19:50:23.9989556Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:23.9989713Z 2025-05-07T19:50:23.9989811Z 2025-05-07T19:50:23.9990060Z OTHER_SRCS: 2025-05-07T19:50:23.9990197Z 2025-05-07T19:50:23.9990292Z 2025-05-07T19:50:23.9990535Z CC_FLAGS: 2025-05-07T19:50:23.9990663Z 2025-05-07T19:50:23.9990755Z 2025-05-07T19:50:23.9990997Z NVCC_FLAGS: 2025-05-07T19:50:23.9991133Z 2025-05-07T19:50:23.9991235Z 2025-05-07T19:50:23.9991482Z HIPCC_FLAGS: 2025-05-07T19:50:23.9991625Z 2025-05-07T19:50:23.9991748Z 2025-05-07T19:50:23.9991970Z INCLUDE_DIRS: 2025-05-07T19:50:23.9992263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:23.9992601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:23.9992934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:23.9993263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:23.9993863Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:23.9994666Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:23.9995358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:23.9995817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:23.9996341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:23.9996866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:23.9997399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:23.9997905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:23.9998483Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:23.9999035Z 2025-05-07T19:50:23.9999288Z Selected Source Files: 2025-05-07T19:50:23.9999638Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:24.0000138Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:24.0000597Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:24.0001055Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:24.0001466Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:24.0001887Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:24.0002307Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:24.0002697Z 2025-05-07T19:50:24.0002946Z HIPified Source Files: 2025-05-07T19:50:24.0003112Z 2025-05-07T19:50:24.0003205Z 2025-05-07T19:50:24.0003453Z Library Dependencies: 2025-05-07T19:50:24.0003709Z torch 2025-05-07T19:50:24.0003955Z torch_library 2025-05-07T19:50:24.0004411Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0005128Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0005841Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0006673Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0007538Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0008160Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0008613Z 2025-05-07T19:50:24.0008831Z Output Library: 2025-05-07T19:50:24.0009111Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:24.0009362Z 2025-05-07T19:50:24.0009619Z Destination Directory: 2025-05-07T19:50:24.0009885Z fbgemm_gpu 2025-05-07T19:50:24.0010166Z ================================================================================ 2025-05-07T19:50:24.0010407Z 2025-05-07T19:50:24.0010411Z 2025-05-07T19:50:24.0010416Z 2025-05-07T19:50:24.0010567Z ================================================================================ 2025-05-07T19:50:24.0010997Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:50:24.0011412Z 2025-05-07T19:50:24.0011625Z CPU_SRCS: 2025-05-07T19:50:24.0011902Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:24.0012214Z 2025-05-07T19:50:24.0012449Z GPU_SRCS: 2025-05-07T19:50:24.0012693Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:24.0013026Z 2025-05-07T19:50:24.0013268Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0013421Z 2025-05-07T19:50:24.0013517Z 2025-05-07T19:50:24.0013752Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0013903Z 2025-05-07T19:50:24.0013993Z 2025-05-07T19:50:24.0014220Z OTHER_SRCS: 2025-05-07T19:50:24.0014349Z 2025-05-07T19:50:24.0014445Z 2025-05-07T19:50:24.0014683Z CC_FLAGS: 2025-05-07T19:50:24.0014807Z 2025-05-07T19:50:24.0014897Z 2025-05-07T19:50:24.0015131Z NVCC_FLAGS: 2025-05-07T19:50:24.0015261Z 2025-05-07T19:50:24.0015377Z 2025-05-07T19:50:24.0015581Z HIPCC_FLAGS: 2025-05-07T19:50:24.0015719Z 2025-05-07T19:50:24.0015836Z 2025-05-07T19:50:24.0016041Z INCLUDE_DIRS: 2025-05-07T19:50:24.0016395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0016737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0017082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0017419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0017962Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0018767Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0019462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0019924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0020365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0020887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0021423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0021935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0022512Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0023070Z 2025-05-07T19:50:24.0023325Z Selected Source Files: 2025-05-07T19:50:24.0023616Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:24.0023988Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:24.0024296Z 2025-05-07T19:50:24.0024534Z HIPified Source Files: 2025-05-07T19:50:24.0024698Z 2025-05-07T19:50:24.0024792Z 2025-05-07T19:50:24.0025040Z Library Dependencies: 2025-05-07T19:50:24.0025306Z torch 2025-05-07T19:50:24.0025553Z torch_library 2025-05-07T19:50:24.0026008Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0026738Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0027485Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0028300Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0029082Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0029645Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:24.0030060Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0030487Z 2025-05-07T19:50:24.0030742Z Output Library: 2025-05-07T19:50:24.0031039Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:24.0031324Z 2025-05-07T19:50:24.0031576Z Destination Directory: 2025-05-07T19:50:24.0031843Z fbgemm_gpu 2025-05-07T19:50:24.0032115Z ================================================================================ 2025-05-07T19:50:24.0032355Z 2025-05-07T19:50:24.0032458Z 2025-05-07T19:50:24.0032462Z 2025-05-07T19:50:24.0032587Z ================================================================================ 2025-05-07T19:50:24.0033013Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:50:24.0033399Z 2025-05-07T19:50:24.0033613Z CPU_SRCS: 2025-05-07T19:50:24.0033921Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:24.0034368Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:24.0034814Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:24.0035142Z 2025-05-07T19:50:24.0035376Z GPU_SRCS: 2025-05-07T19:50:24.0035633Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:24.0036022Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:24.0036484Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:24.0036859Z 2025-05-07T19:50:24.0037186Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0037345Z 2025-05-07T19:50:24.0037440Z 2025-05-07T19:50:24.0037698Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0037857Z 2025-05-07T19:50:24.0037955Z 2025-05-07T19:50:24.0038204Z OTHER_SRCS: 2025-05-07T19:50:24.0038340Z 2025-05-07T19:50:24.0038436Z 2025-05-07T19:50:24.0038677Z CC_FLAGS: 2025-05-07T19:50:24.0038807Z 2025-05-07T19:50:24.0038901Z 2025-05-07T19:50:24.0039227Z NVCC_FLAGS: 2025-05-07T19:50:24.0039479Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0039820Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0040176Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0040504Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0040824Z 2025-05-07T19:50:24.0041046Z HIPCC_FLAGS: 2025-05-07T19:50:24.0041190Z 2025-05-07T19:50:24.0041311Z 2025-05-07T19:50:24.0041531Z INCLUDE_DIRS: 2025-05-07T19:50:24.0041817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0042151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0042483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0042813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0043352Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0044174Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0044843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0045297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0045756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0046280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0046993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0047516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0048128Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0048648Z 2025-05-07T19:50:24.0048908Z Selected Source Files: 2025-05-07T19:50:24.0049233Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:24.0049706Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:24.0050128Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:24.0050528Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:24.0050901Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:24.0051286Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:24.0051635Z 2025-05-07T19:50:24.0051969Z HIPified Source Files: 2025-05-07T19:50:24.0052138Z 2025-05-07T19:50:24.0052261Z 2025-05-07T19:50:24.0052477Z Library Dependencies: 2025-05-07T19:50:24.0052763Z torch 2025-05-07T19:50:24.0052979Z torch_library 2025-05-07T19:50:24.0053455Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0054147Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0054887Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0055731Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0056489Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0057019Z fbgemm 2025-05-07T19:50:24.0057254Z fbgemm_gpu_config 2025-05-07T19:50:24.0057662Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0058085Z 2025-05-07T19:50:24.0058323Z Output Library: 2025-05-07T19:50:24.0058576Z fbgemm_gpu_tbe_common 2025-05-07T19:50:24.0058854Z 2025-05-07T19:50:24.0059108Z Destination Directory: 2025-05-07T19:50:24.0059369Z fbgemm_gpu 2025-05-07T19:50:24.0059667Z ================================================================================ 2025-05-07T19:50:24.0059909Z 2025-05-07T19:50:24.0059913Z 2025-05-07T19:50:24.0059917Z 2025-05-07T19:50:24.0060045Z ================================================================================ 2025-05-07T19:50:24.0060491Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:50:24.0060872Z 2025-05-07T19:50:24.0061111Z CPU_SRCS: 2025-05-07T19:50:24.0061239Z 2025-05-07T19:50:24.0061357Z 2025-05-07T19:50:24.0061566Z GPU_SRCS: 2025-05-07T19:50:24.0061867Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:24.0062381Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:24.0062852Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:24.0063223Z 2025-05-07T19:50:24.0063480Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0063637Z 2025-05-07T19:50:24.0063737Z 2025-05-07T19:50:24.0063995Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0064151Z 2025-05-07T19:50:24.0064248Z 2025-05-07T19:50:24.0064494Z OTHER_SRCS: 2025-05-07T19:50:24.0064631Z 2025-05-07T19:50:24.0064758Z 2025-05-07T19:50:24.0064966Z CC_FLAGS: 2025-05-07T19:50:24.0065097Z 2025-05-07T19:50:24.0065225Z 2025-05-07T19:50:24.0065440Z NVCC_FLAGS: 2025-05-07T19:50:24.0065720Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0066017Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0066360Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0066686Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0067006Z 2025-05-07T19:50:24.0067224Z HIPCC_FLAGS: 2025-05-07T19:50:24.0067384Z 2025-05-07T19:50:24.0067484Z 2025-05-07T19:50:24.0067713Z INCLUDE_DIRS: 2025-05-07T19:50:24.0067962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0068323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0068623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0068972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0069479Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0070300Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0070958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0071409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0071875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0072364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0072928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0073417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0074029Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0074618Z 2025-05-07T19:50:24.0074862Z Selected Source Files: 2025-05-07T19:50:24.0075174Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:24.0075621Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:24.0076083Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:24.0076517Z 2025-05-07T19:50:24.0076769Z HIPified Source Files: 2025-05-07T19:50:24.0076939Z 2025-05-07T19:50:24.0077035Z 2025-05-07T19:50:24.0077293Z Library Dependencies: 2025-05-07T19:50:24.0077551Z torch 2025-05-07T19:50:24.0077812Z torch_library 2025-05-07T19:50:24.0078268Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0078990Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0079738Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0080554Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0081325Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0081939Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0082392Z 2025-05-07T19:50:24.0082599Z Output Library: 2025-05-07T19:50:24.0082873Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:24.0083133Z 2025-05-07T19:50:24.0083371Z Destination Directory: 2025-05-07T19:50:24.0083649Z fbgemm_gpu 2025-05-07T19:50:24.0083905Z ================================================================================ 2025-05-07T19:50:24.0084145Z 2025-05-07T19:50:24.0084149Z 2025-05-07T19:50:24.0084177Z 2025-05-07T19:50:24.0084297Z ================================================================================ 2025-05-07T19:50:24.0084793Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:50:24.0085209Z 2025-05-07T19:50:24.0085417Z CPU_SRCS: 2025-05-07T19:50:24.0085722Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0086099Z 2025-05-07T19:50:24.0086306Z GPU_SRCS: 2025-05-07T19:50:24.0086593Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:24.0086975Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:24.0087370Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:24.0087771Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.0088229Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.0088657Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.0089087Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:24.0089514Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:24.0089896Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:24.0090323Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:24.0090744Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:24.0091205Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.0091639Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.0092093Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:24.0092520Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:24.0092979Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.0093446Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.0093867Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:24.0094294Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:24.0094705Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.0095150Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.0095591Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0096069Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.0096589Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:24.0096991Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:24.0097444Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:24.0097908Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:24.0098389Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:24.0098820Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0099276Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:24.0099697Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:24.0100160Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0100633Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.0101053Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:24.0101513Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0101985Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.0102476Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.0102889Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:24.0103359Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:24.0103866Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:24.0104330Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:24.0104795Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0105235Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.0105705Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:24.0106151Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0106682Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.0107114Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.0107584Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.0108099Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.0108565Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.0109023Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0109409Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0109771Z 2025-05-07T19:50:24.0109987Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0110166Z 2025-05-07T19:50:24.0110257Z 2025-05-07T19:50:24.0110472Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0110647Z 2025-05-07T19:50:24.0110735Z 2025-05-07T19:50:24.0110974Z OTHER_SRCS: 2025-05-07T19:50:24.0111102Z 2025-05-07T19:50:24.0111193Z 2025-05-07T19:50:24.0111421Z CC_FLAGS: 2025-05-07T19:50:24.0111546Z 2025-05-07T19:50:24.0111639Z 2025-05-07T19:50:24.0111871Z NVCC_FLAGS: 2025-05-07T19:50:24.0112122Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0112451Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0112758Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0113114Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0113393Z 2025-05-07T19:50:24.0113624Z HIPCC_FLAGS: 2025-05-07T19:50:24.0113762Z 2025-05-07T19:50:24.0113881Z 2025-05-07T19:50:24.0114088Z INCLUDE_DIRS: 2025-05-07T19:50:24.0114365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0114704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0115027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0115358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0115897Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0116776Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0117487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0117961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0118418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0119059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0119608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0120120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0120697Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0121250Z 2025-05-07T19:50:24.0121505Z Selected Source Files: 2025-05-07T19:50:24.0121820Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0122254Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:24.0122680Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.0123141Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:24.0123569Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.0124015Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:24.0124425Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:24.0124893Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.0125368Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.0125816Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.0126301Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:24.0126728Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0127138Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0127515Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:24.0127914Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:24.0128285Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:24.0128700Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.0129244Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.0129669Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:24.0130106Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:24.0130493Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:24.0130910Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:24.0131312Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:24.0131767Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.0132219Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:24.0132641Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.0133090Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:24.0133503Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:24.0133964Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0134392Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:24.0134822Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:24.0135245Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:24.0135731Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:24.0136202Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:24.0136625Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0137076Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:24.0137478Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:24.0137896Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0138282Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:24.0138698Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0139136Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.0139527Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:24.0139959Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:24.0140419Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:24.0140935Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:24.0141354Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0141790Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.0142199Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:24.0142639Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:24.0143055Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:24.0143476Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.0143948Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.0144394Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:24.0144764Z 2025-05-07T19:50:24.0144962Z HIPified Source Files: 2025-05-07T19:50:24.0145136Z 2025-05-07T19:50:24.0145220Z 2025-05-07T19:50:24.0145420Z Library Dependencies: 2025-05-07T19:50:24.0145662Z torch 2025-05-07T19:50:24.0145879Z torch_library 2025-05-07T19:50:24.0146313Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0147140Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0147826Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0148623Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0149351Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0149837Z fbgemm_gpu_tbe_common 2025-05-07T19:50:24.0150209Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0150602Z 2025-05-07T19:50:24.0150805Z Output Library: 2025-05-07T19:50:24.0151133Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:50:24.0151410Z 2025-05-07T19:50:24.0151605Z Destination Directory: 2025-05-07T19:50:24.0151861Z fbgemm_gpu 2025-05-07T19:50:24.0152091Z ================================================================================ 2025-05-07T19:50:24.0152333Z 2025-05-07T19:50:24.0152514Z 2025-05-07T19:50:24.0152518Z 2025-05-07T19:50:24.0152637Z ================================================================================ 2025-05-07T19:50:24.0153083Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:50:24.0153476Z 2025-05-07T19:50:24.0153678Z CPU_SRCS: 2025-05-07T19:50:24.0153915Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.0154302Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.0154670Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.0155014Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:24.0155343Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:24.0155699Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.0156098Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:24.0156612Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:24.0157017Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:24.0157428Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:24.0157879Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.0158293Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.0158804Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:24.0159376Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:24.0159933Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:24.0160452Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.0160878Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.0161330Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0161797Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0162353Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0162808Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0163224Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0163653Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0164150Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0164696Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0165161Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0165665Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0166204Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0166704Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0167318Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0167993Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0168661Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0169259Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0169715Z 2025-05-07T19:50:24.0169957Z GPU_SRCS: 2025-05-07T19:50:24.0170254Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0170761Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0171232Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0171690Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0173285Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0173767Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0174278Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0174882Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0175417Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0175950Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0176535Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0177069Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0177715Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0178398Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0205563Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0206367Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0206984Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0207431Z 2025-05-07T19:50:24.0207663Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0207829Z 2025-05-07T19:50:24.0207958Z 2025-05-07T19:50:24.0208180Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0208362Z 2025-05-07T19:50:24.0208455Z 2025-05-07T19:50:24.0208680Z OTHER_SRCS: 2025-05-07T19:50:24.0208848Z 2025-05-07T19:50:24.0208943Z 2025-05-07T19:50:24.0209141Z CC_FLAGS: 2025-05-07T19:50:24.0209304Z 2025-05-07T19:50:24.0209399Z 2025-05-07T19:50:24.0209648Z NVCC_FLAGS: 2025-05-07T19:50:24.0209897Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0210235Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0210548Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0210907Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0211192Z 2025-05-07T19:50:24.0211438Z HIPCC_FLAGS: 2025-05-07T19:50:24.0211584Z 2025-05-07T19:50:24.0211677Z 2025-05-07T19:50:24.0211930Z INCLUDE_DIRS: 2025-05-07T19:50:24.0212201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0212719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0213030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0213394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0213942Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0214756Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0215458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0215891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0216373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0216877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0217452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0217959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0218545Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0219100Z 2025-05-07T19:50:24.0219325Z Selected Source Files: 2025-05-07T19:50:24.0219665Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.0220068Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.0220495Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.0220856Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:24.0221245Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:24.0221646Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.0222069Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:24.0222567Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:24.0222983Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:24.0223511Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:24.0223982Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:24.0224458Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.0225013Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:24.0225585Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:24.0226208Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:24.0226735Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.0227226Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:24.0227659Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0228166Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0228661Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0229093Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0229555Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0229991Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0230531Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0231085Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0231612Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0232133Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0232716Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0233274Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0233895Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0234610Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0235286Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0236011Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:24.0236633Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0237155Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0237665Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0238104Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0238571Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0239016Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0239557Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0240119Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0240666Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0241224Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0241779Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0242333Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0242948Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0243662Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0244334Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0244976Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0245551Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:24.0245948Z 2025-05-07T19:50:24.0246194Z HIPified Source Files: 2025-05-07T19:50:24.0246365Z 2025-05-07T19:50:24.0246582Z 2025-05-07T19:50:24.0246947Z Library Dependencies: 2025-05-07T19:50:24.0247205Z torch 2025-05-07T19:50:24.0247458Z torch_library 2025-05-07T19:50:24.0247918Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0248641Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0249379Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0250190Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0250968Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0251435Z fbgemm 2025-05-07T19:50:24.0251669Z fbgemm_gpu_config 2025-05-07T19:50:24.0251913Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:24.0252199Z fbgemm_gpu_tbe_common 2025-05-07T19:50:24.0252460Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:24.0252757Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:24.0253205Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0253624Z 2025-05-07T19:50:24.0253856Z Output Library: 2025-05-07T19:50:24.0254129Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:50:24.0254457Z 2025-05-07T19:50:24.0254678Z Destination Directory: 2025-05-07T19:50:24.0254964Z fbgemm_gpu 2025-05-07T19:50:24.0255213Z ================================================================================ 2025-05-07T19:50:24.0255477Z 2025-05-07T19:50:24.0255481Z 2025-05-07T19:50:24.0255487Z 2025-05-07T19:50:24.0255608Z ================================================================================ 2025-05-07T19:50:24.0256073Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:50:24.0256445Z 2025-05-07T19:50:24.0256678Z CPU_SRCS: 2025-05-07T19:50:24.0257016Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:24.0257491Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:24.0257857Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:24.0258256Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:24.0258645Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:24.0259225Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:24.0259601Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:24.0259960Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:24.0260386Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:24.0260826Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:24.0261243Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:24.0261664Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:24.0262127Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:24.0262539Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:24.0263066Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:24.0263660Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:24.0264225Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:24.0264764Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:24.0265184Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:24.0265583Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:24.0265958Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:24.0266288Z 2025-05-07T19:50:24.0266513Z GPU_SRCS: 2025-05-07T19:50:24.0266785Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:24.0267241Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:24.0267697Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:24.0268165Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:24.0268611Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:24.0269179Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:24.0269685Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:24.0270234Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0270810Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0271381Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0271939Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:24.0272439Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0272992Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0273468Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:24.0273934Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0274426Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0274902Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0275424Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0275951Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0276547Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:24.0277181Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0277741Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0278266Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:24.0278770Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0279335Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0279883Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0280478Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0281087Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0281684Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:24.0282309Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0282859Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0283368Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:24.0283788Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0284256Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0284700Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0285217Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0285753Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0286220Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:24.0286674Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0287133Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0287608Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:24.0288047Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0288511Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0289054Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0289527Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0290025Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0290458Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:24.0290888Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0291317Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0291746Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:24.0292219Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0292666Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0293131Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0293578Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0294086Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0294522Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:24.0294962Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0295398Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0295846Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:24.0296296Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0296746Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0297234Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0297721Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0298242Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0298691Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:24.0299148Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0299628Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0300095Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:24.0300619Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0301160Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0301716Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0302280Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0302884Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0303455Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:24.0304022Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0304593Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0305112Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:24.0305651Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0306186Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0306743Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0307326Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0307909Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0308480Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:24.0308996Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0309555Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0310016Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:24.0310431Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0310866Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0311283Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0311747Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0312213Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0312670Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:24.0313122Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0313580Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0314086Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:24.0314648Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0315249Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0315834Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0316539Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0317386Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0318141Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:24.0318779Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0319422Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0319910Z 2025-05-07T19:50:24.0320130Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0320313Z 2025-05-07T19:50:24.0320407Z 2025-05-07T19:50:24.0320615Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0320993Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:24.0321492Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:24.0321989Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:24.0322389Z 2025-05-07T19:50:24.0322584Z OTHER_SRCS: 2025-05-07T19:50:24.0322719Z 2025-05-07T19:50:24.0322837Z 2025-05-07T19:50:24.0323037Z CC_FLAGS: 2025-05-07T19:50:24.0323163Z 2025-05-07T19:50:24.0323276Z 2025-05-07T19:50:24.0323475Z NVCC_FLAGS: 2025-05-07T19:50:24.0323739Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0324026Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0324350Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0324668Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0324978Z 2025-05-07T19:50:24.0325189Z HIPCC_FLAGS: 2025-05-07T19:50:24.0325348Z 2025-05-07T19:50:24.0325514Z 2025-05-07T19:50:24.0325749Z INCLUDE_DIRS: 2025-05-07T19:50:24.0325995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0326354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0326656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0327010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0327517Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0328337Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0329110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0329505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0329945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0330399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0330916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0331361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0331857Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0331997Z 2025-05-07T19:50:24.0332095Z Selected Source Files: 2025-05-07T19:50:24.0332303Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:24.0332421Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:24.0332545Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:24.0332689Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:24.0332812Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:24.0332925Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:24.0333038Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:24.0333225Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:24.0333389Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:24.0333547Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:24.0333662Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:24.0333861Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:24.0333992Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:24.0334160Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:24.0334390Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:24.0334623Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:24.0334831Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:24.0335021Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:24.0335144Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:24.0335294Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:24.0335409Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:24.0335565Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:24.0335730Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:24.0335895Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:24.0336066Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:24.0336233Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:24.0336424Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:24.0336633Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:24.0336826Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0337050Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0337273Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0337460Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:24.0337713Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0337913Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0338076Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:24.0338242Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0338413Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0338599Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0338795Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0338994Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0339158Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:24.0339336Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0339517Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0339690Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:24.0339900Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0340098Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0340300Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0340536Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0340762Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0340944Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:24.0341159Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0341366Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0341551Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:24.0341705Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0341879Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0342035Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0342217Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0342415Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0342554Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:24.0342719Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0342899Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0343034Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:24.0343190Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0343347Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0343528Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0343713Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0343906Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0344067Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:24.0344230Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0344395Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0344542Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:24.0344699Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0344858Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0345017Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0345216Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0345406Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0345550Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:24.0345725Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0345961Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0346111Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:24.0346295Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0346601Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0346951Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0347246Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0347468Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0347628Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:24.0347820Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0348027Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0348232Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:24.0348471Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0348729Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0348967Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0349233Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0349521Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0349730Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:24.0349965Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0350207Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0350522Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:24.0350751Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0350989Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0351252Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0351512Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0351774Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0352020Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:24.0352254Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0352492Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0352668Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:24.0352839Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0353004Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0353181Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0353390Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0353585Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0353734Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:24.0353926Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0354108Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0354333Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:24.0354607Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0354863Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0355129Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0355408Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0355790Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0356023Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:24.0356340Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0356629Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0356725Z 2025-05-07T19:50:24.0356824Z HIPified Source Files: 2025-05-07T19:50:24.0356830Z 2025-05-07T19:50:24.0356936Z 2025-05-07T19:50:24.0357043Z Library Dependencies: 2025-05-07T19:50:24.0357132Z torch 2025-05-07T19:50:24.0357216Z torch_library 2025-05-07T19:50:24.0357559Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0357823Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0358165Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0358533Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0358820Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0358914Z fbgemm 2025-05-07T19:50:24.0359028Z fbgemm_gpu_config 2025-05-07T19:50:24.0359127Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:24.0359226Z fbgemm_gpu_tbe_common 2025-05-07T19:50:24.0359309Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:24.0359434Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:24.0359653Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0359732Z 2025-05-07T19:50:24.0359833Z Output Library: 2025-05-07T19:50:24.0360034Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:24.0360111Z 2025-05-07T19:50:24.0360207Z Destination Directory: 2025-05-07T19:50:24.0360301Z fbgemm_gpu 2025-05-07T19:50:24.0360417Z ================================================================================ 2025-05-07T19:50:24.0360422Z 2025-05-07T19:50:24.0360426Z 2025-05-07T19:50:24.0360430Z 2025-05-07T19:50:24.0360533Z ================================================================================ 2025-05-07T19:50:24.0360750Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:50:24.0360827Z 2025-05-07T19:50:24.0360909Z CPU_SRCS: 2025-05-07T19:50:24.0360914Z 2025-05-07T19:50:24.0361011Z 2025-05-07T19:50:24.0361095Z GPU_SRCS: 2025-05-07T19:50:24.0361287Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:24.0361502Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:24.0361735Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:24.0361936Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:24.0362156Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:24.0362401Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:24.0362607Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:24.0362830Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:24.0363072Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:24.0363287Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:24.0363522Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:24.0363770Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:24.0363849Z 2025-05-07T19:50:24.0363937Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0363945Z 2025-05-07T19:50:24.0364022Z 2025-05-07T19:50:24.0364121Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0364125Z 2025-05-07T19:50:24.0364202Z 2025-05-07T19:50:24.0364336Z OTHER_SRCS: 2025-05-07T19:50:24.0364340Z 2025-05-07T19:50:24.0364422Z 2025-05-07T19:50:24.0364503Z CC_FLAGS: 2025-05-07T19:50:24.0364507Z 2025-05-07T19:50:24.0364586Z 2025-05-07T19:50:24.0364678Z NVCC_FLAGS: 2025-05-07T19:50:24.0364778Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0364880Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0364984Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0365092Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0365169Z 2025-05-07T19:50:24.0365256Z HIPCC_FLAGS: 2025-05-07T19:50:24.0365260Z 2025-05-07T19:50:24.0365347Z 2025-05-07T19:50:24.0365432Z INCLUDE_DIRS: 2025-05-07T19:50:24.0365544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0365643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0365757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0365860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0366140Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0366544Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0366689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0366852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0367009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0367225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0367424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0367567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0367884Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0367961Z 2025-05-07T19:50:24.0368053Z Selected Source Files: 2025-05-07T19:50:24.0368292Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:24.0368617Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:24.0368827Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:24.0369014Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:24.0369228Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:24.0369434Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:24.0369625Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:24.0369850Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:24.0370065Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:24.0370262Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:24.0370489Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:24.0370709Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:24.0370785Z 2025-05-07T19:50:24.0370880Z HIPified Source Files: 2025-05-07T19:50:24.0370885Z 2025-05-07T19:50:24.0370957Z 2025-05-07T19:50:24.0371037Z Library Dependencies: 2025-05-07T19:50:24.0371124Z torch 2025-05-07T19:50:24.0371198Z torch_library 2025-05-07T19:50:24.0371481Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0371713Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0372025Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0372341Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0372591Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0372704Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:24.0372895Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0373009Z 2025-05-07T19:50:24.0373099Z Output Library: 2025-05-07T19:50:24.0373197Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:50:24.0373265Z 2025-05-07T19:50:24.0373351Z Destination Directory: 2025-05-07T19:50:24.0373444Z fbgemm_gpu 2025-05-07T19:50:24.0373549Z ================================================================================ 2025-05-07T19:50:24.0373554Z 2025-05-07T19:50:24.0373558Z 2025-05-07T19:50:24.0373561Z 2025-05-07T19:50:24.0373660Z ================================================================================ 2025-05-07T19:50:24.0373856Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:50:24.0373930Z 2025-05-07T19:50:24.0374002Z CPU_SRCS: 2025-05-07T19:50:24.0374006Z 2025-05-07T19:50:24.0374086Z 2025-05-07T19:50:24.0374170Z GPU_SRCS: 2025-05-07T19:50:24.0374351Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0374527Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0374727Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0374908Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0375131Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0375371Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0375514Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0375659Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0375812Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0375970Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0376164Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0376313Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0376500Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0376700Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0376902Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0377076Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0377266Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0377456Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0377647Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0377850Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0378054Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0378238Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0378438Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0378639Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0378861Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0379106Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0379346Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0379572Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0379826Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0380071Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0380211Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0380381Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0380600Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0380740Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0380918Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0381085Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0381224Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0381384Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0381558Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0381703Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0381874Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0382061Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0382200Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0382358Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0382525Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0382669Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0382835Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0383007Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0383085Z 2025-05-07T19:50:24.0383170Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0383174Z 2025-05-07T19:50:24.0383246Z 2025-05-07T19:50:24.0383333Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0383337Z 2025-05-07T19:50:24.0383405Z 2025-05-07T19:50:24.0383480Z OTHER_SRCS: 2025-05-07T19:50:24.0383486Z 2025-05-07T19:50:24.0383555Z 2025-05-07T19:50:24.0383638Z CC_FLAGS: 2025-05-07T19:50:24.0383641Z 2025-05-07T19:50:24.0383709Z 2025-05-07T19:50:24.0383783Z NVCC_FLAGS: 2025-05-07T19:50:24.0383928Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0384022Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0384118Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0384213Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0384290Z 2025-05-07T19:50:24.0384367Z HIPCC_FLAGS: 2025-05-07T19:50:24.0384371Z 2025-05-07T19:50:24.0384442Z 2025-05-07T19:50:24.0384524Z INCLUDE_DIRS: 2025-05-07T19:50:24.0384623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0384711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0384816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0384913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0385168Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0385528Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0385667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0385813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0385960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0386157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0386341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0386474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0386766Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0386834Z 2025-05-07T19:50:24.0386920Z Selected Source Files: 2025-05-07T19:50:24.0387097Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0387275Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0387462Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0387637Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0387869Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0388098Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0388603Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0388780Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0388934Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0389101Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0389248Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:24.0389401Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:24.0389575Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0389778Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0390002Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0390179Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0390371Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0390579Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0390758Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0390966Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0391173Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0391355Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0391554Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0391755Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0391987Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0392268Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0392512Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0392752Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0392994Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0393240Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0393390Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0393545Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0393703Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0393841Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0394028Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0394204Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0394342Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0394519Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0394697Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0394845Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0395040Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0395221Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0395369Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:24.0395530Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0395708Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0395859Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:24.0396040Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:24.0396313Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:24.0396445Z 2025-05-07T19:50:24.0396542Z HIPified Source Files: 2025-05-07T19:50:24.0396546Z 2025-05-07T19:50:24.0396811Z 2025-05-07T19:50:24.0396908Z Library Dependencies: 2025-05-07T19:50:24.0396994Z torch 2025-05-07T19:50:24.0397089Z torch_library 2025-05-07T19:50:24.0397432Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0397683Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0398014Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0398382Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0398647Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0398764Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:24.0399000Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0399084Z 2025-05-07T19:50:24.0399176Z Output Library: 2025-05-07T19:50:24.0399292Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:50:24.0399390Z 2025-05-07T19:50:24.0399486Z Destination Directory: 2025-05-07T19:50:24.0399573Z fbgemm_gpu 2025-05-07T19:50:24.0399709Z ================================================================================ 2025-05-07T19:50:24.0399714Z 2025-05-07T19:50:24.0399718Z 2025-05-07T19:50:24.0399721Z 2025-05-07T19:50:24.0399833Z ================================================================================ 2025-05-07T19:50:24.0400050Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:50:24.0400160Z 2025-05-07T19:50:24.0400243Z CPU_SRCS: 2025-05-07T19:50:24.0400247Z 2025-05-07T19:50:24.0400327Z 2025-05-07T19:50:24.0400423Z GPU_SRCS: 2025-05-07T19:50:24.0400648Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:24.0400813Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:24.0400991Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0401200Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0401384Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0401575Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:24.0401807Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0402019Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0402182Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:24.0402349Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:24.0402572Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0402764Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0402896Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:24.0403016Z 2025-05-07T19:50:24.0403122Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0403127Z 2025-05-07T19:50:24.0403225Z 2025-05-07T19:50:24.0403359Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0403364Z 2025-05-07T19:50:24.0403455Z 2025-05-07T19:50:24.0403551Z OTHER_SRCS: 2025-05-07T19:50:24.0403556Z 2025-05-07T19:50:24.0403649Z 2025-05-07T19:50:24.0403780Z CC_FLAGS: 2025-05-07T19:50:24.0403784Z 2025-05-07T19:50:24.0403877Z 2025-05-07T19:50:24.0403974Z NVCC_FLAGS: 2025-05-07T19:50:24.0404123Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0404238Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0404361Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0404482Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0404618Z 2025-05-07T19:50:24.0404720Z HIPCC_FLAGS: 2025-05-07T19:50:24.0404724Z 2025-05-07T19:50:24.0404822Z 2025-05-07T19:50:24.0404954Z INCLUDE_DIRS: 2025-05-07T19:50:24.0405081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0405199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0405314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0405468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0405811Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0406209Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0406404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0406586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0406761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0407015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0407233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0407396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0407723Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0407855Z 2025-05-07T19:50:24.0407963Z Selected Source Files: 2025-05-07T19:50:24.0408142Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:24.0408370Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:24.0408542Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:24.0408673Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:24.0408962Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:24.0409132Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:24.0409304Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:24.0409481Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:24.0409703Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:24.0409903Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:24.0410115Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:24.0410325Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:24.0410511Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:24.0410604Z 2025-05-07T19:50:24.0410711Z HIPified Source Files: 2025-05-07T19:50:24.0410741Z 2025-05-07T19:50:24.0410824Z 2025-05-07T19:50:24.0410928Z Library Dependencies: 2025-05-07T19:50:24.0411019Z torch 2025-05-07T19:50:24.0411138Z torch_library 2025-05-07T19:50:24.0411431Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0411676Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0412017Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0412351Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0412615Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0412728Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:24.0412964Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0413050Z 2025-05-07T19:50:24.0413144Z Output Library: 2025-05-07T19:50:24.0413283Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:50:24.0413369Z 2025-05-07T19:50:24.0413467Z Destination Directory: 2025-05-07T19:50:24.0413751Z fbgemm_gpu 2025-05-07T19:50:24.0413870Z ================================================================================ 2025-05-07T19:50:24.0413874Z 2025-05-07T19:50:24.0413878Z 2025-05-07T19:50:24.0413882Z 2025-05-07T19:50:24.0413998Z ================================================================================ 2025-05-07T19:50:24.0414256Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:50:24.0414345Z 2025-05-07T19:50:24.0414439Z CPU_SRCS: 2025-05-07T19:50:24.0414443Z 2025-05-07T19:50:24.0414537Z 2025-05-07T19:50:24.0414662Z GPU_SRCS: 2025-05-07T19:50:24.0414784Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:24.0414974Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:24.0415121Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:24.0415235Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:24.0415346Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:24.0415470Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:24.0415658Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:24.0415814Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:24.0415930Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:24.0416140Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:24.0416269Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:24.0416427Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:24.0416661Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:24.0416881Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:24.0417081Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:24.0417247Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:24.0417412Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:24.0417566Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:24.0417733Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:24.0417944Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:24.0418132Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:24.0418280Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:24.0418458Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:24.0418605Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:24.0418795Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:24.0418939Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:24.0419121Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:24.0419279Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:24.0419447Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:24.0419675Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:24.0419885Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:24.0420086Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:24.0420317Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:24.0420461Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:24.0420614Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:24.0420843Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:24.0421102Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:24.0421193Z 2025-05-07T19:50:24.0421291Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0421295Z 2025-05-07T19:50:24.0421405Z 2025-05-07T19:50:24.0421501Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0421505Z 2025-05-07T19:50:24.0421591Z 2025-05-07T19:50:24.0421706Z OTHER_SRCS: 2025-05-07T19:50:24.0421710Z 2025-05-07T19:50:24.0421794Z 2025-05-07T19:50:24.0421882Z CC_FLAGS: 2025-05-07T19:50:24.0421885Z 2025-05-07T19:50:24.0421969Z 2025-05-07T19:50:24.0422082Z NVCC_FLAGS: 2025-05-07T19:50:24.0422188Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0422294Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0422431Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0422538Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0422624Z 2025-05-07T19:50:24.0422717Z HIPCC_FLAGS: 2025-05-07T19:50:24.0422721Z 2025-05-07T19:50:24.0422835Z 2025-05-07T19:50:24.0422929Z INCLUDE_DIRS: 2025-05-07T19:50:24.0423045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0423176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0423333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0423448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0423721Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0424119Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0424266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0424429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0424614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0424816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0425015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0425198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0425497Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0425591Z 2025-05-07T19:50:24.0425697Z Selected Source Files: 2025-05-07T19:50:24.0425841Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:24.0425979Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:24.0426091Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:24.0426230Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:24.0426341Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:24.0426464Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:24.0426644Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:24.0426795Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:24.0426913Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:24.0427099Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:24.0427295Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:24.0427455Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:24.0427662Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:24.0427909Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:24.0428103Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:24.0428269Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:24.0428430Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:24.0428583Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:24.0428746Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:24.0428928Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:24.0429146Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:24.0429286Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:24.0429440Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:24.0429613Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:24.0429768Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:24.0429911Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:24.0430095Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:24.0430252Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:24.0430424Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:24.0430626Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:24.0430866Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:24.0431069Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:24.0431277Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:24.0431454Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:24.0431609Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:24.0431839Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:24.0432140Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:24.0432233Z 2025-05-07T19:50:24.0432332Z HIPified Source Files: 2025-05-07T19:50:24.0432336Z 2025-05-07T19:50:24.0432425Z 2025-05-07T19:50:24.0432549Z Library Dependencies: 2025-05-07T19:50:24.0432639Z torch 2025-05-07T19:50:24.0432733Z torch_library 2025-05-07T19:50:24.0433061Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0433313Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0433631Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0433973Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0434267Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0434365Z fbgemm_gpu_config 2025-05-07T19:50:24.0434464Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:24.0434704Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0434788Z 2025-05-07T19:50:24.0434883Z Output Library: 2025-05-07T19:50:24.0435023Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:50:24.0435107Z 2025-05-07T19:50:24.0435203Z Destination Directory: 2025-05-07T19:50:24.0435295Z fbgemm_gpu 2025-05-07T19:50:24.0435428Z ================================================================================ 2025-05-07T19:50:24.0435432Z 2025-05-07T19:50:24.0435435Z 2025-05-07T19:50:24.0435439Z 2025-05-07T19:50:24.0435550Z ================================================================================ 2025-05-07T19:50:24.0435758Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:50:24.0435857Z 2025-05-07T19:50:24.0435943Z CPU_SRCS: 2025-05-07T19:50:24.0436212Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:24.0436441Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:24.0436520Z 2025-05-07T19:50:24.0436771Z GPU_SRCS: 2025-05-07T19:50:24.0436975Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:24.0437146Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:24.0437272Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:24.0437417Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:24.0437598Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:24.0437737Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:24.0437883Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:24.0438056Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:24.0438146Z 2025-05-07T19:50:24.0438246Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0438251Z 2025-05-07T19:50:24.0438337Z 2025-05-07T19:50:24.0438459Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0438467Z 2025-05-07T19:50:24.0438552Z 2025-05-07T19:50:24.0438640Z OTHER_SRCS: 2025-05-07T19:50:24.0438645Z 2025-05-07T19:50:24.0438764Z 2025-05-07T19:50:24.0438860Z CC_FLAGS: 2025-05-07T19:50:24.0438864Z 2025-05-07T19:50:24.0438947Z 2025-05-07T19:50:24.0439041Z NVCC_FLAGS: 2025-05-07T19:50:24.0439174Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0439283Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0439394Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0439531Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0439624Z 2025-05-07T19:50:24.0439719Z HIPCC_FLAGS: 2025-05-07T19:50:24.0439724Z 2025-05-07T19:50:24.0439808Z 2025-05-07T19:50:24.0439926Z INCLUDE_DIRS: 2025-05-07T19:50:24.0440050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0440150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0440290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0440409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0440697Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0441163Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0441314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0441489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0441651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0441877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0442086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0442237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0442563Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0442650Z 2025-05-07T19:50:24.0442758Z Selected Source Files: 2025-05-07T19:50:24.0442977Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:24.0443193Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:24.0443390Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:24.0443541Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:24.0443693Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:24.0443836Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:24.0443990Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:24.0444144Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:24.0444286Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:24.0444428Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:24.0444517Z 2025-05-07T19:50:24.0444628Z HIPified Source Files: 2025-05-07T19:50:24.0444632Z 2025-05-07T19:50:24.0444777Z 2025-05-07T19:50:24.0444885Z Library Dependencies: 2025-05-07T19:50:24.0444990Z torch 2025-05-07T19:50:24.0445079Z torch_library 2025-05-07T19:50:24.0445401Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0445683Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0446011Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0446362Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0446800Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0446939Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:24.0447032Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:24.0447258Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0447369Z 2025-05-07T19:50:24.0447462Z Output Library: 2025-05-07T19:50:24.0447570Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:24.0447659Z 2025-05-07T19:50:24.0447776Z Destination Directory: 2025-05-07T19:50:24.0447870Z fbgemm_gpu 2025-05-07T19:50:24.0447988Z ================================================================================ 2025-05-07T19:50:24.0447993Z 2025-05-07T19:50:24.0447997Z 2025-05-07T19:50:24.0448001Z 2025-05-07T19:50:24.0448150Z ================================================================================ 2025-05-07T19:50:24.0448344Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:50:24.0448434Z 2025-05-07T19:50:24.0448553Z CPU_SRCS: 2025-05-07T19:50:24.0448741Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:24.0448821Z 2025-05-07T19:50:24.0448904Z GPU_SRCS: 2025-05-07T19:50:24.0449109Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:24.0449264Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:24.0449349Z 2025-05-07T19:50:24.0449475Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0449481Z 2025-05-07T19:50:24.0449565Z 2025-05-07T19:50:24.0449653Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0449740Z 2025-05-07T19:50:24.0449854Z 2025-05-07T19:50:24.0449954Z OTHER_SRCS: 2025-05-07T19:50:24.0449958Z 2025-05-07T19:50:24.0450037Z 2025-05-07T19:50:24.0450116Z CC_FLAGS: 2025-05-07T19:50:24.0450120Z 2025-05-07T19:50:24.0450231Z 2025-05-07T19:50:24.0450329Z NVCC_FLAGS: 2025-05-07T19:50:24.0450445Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0450568Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0450670Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0450782Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0450875Z 2025-05-07T19:50:24.0450989Z HIPCC_FLAGS: 2025-05-07T19:50:24.0450993Z 2025-05-07T19:50:24.0451086Z 2025-05-07T19:50:24.0451186Z INCLUDE_DIRS: 2025-05-07T19:50:24.0451313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0451417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0451534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0451649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0451956Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0452351Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0452501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0452693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0452852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0453062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0453291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0453445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0453754Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0453900Z 2025-05-07T19:50:24.0454022Z Selected Source Files: 2025-05-07T19:50:24.0454208Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:24.0454390Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:24.0454577Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:24.0454667Z 2025-05-07T19:50:24.0454765Z HIPified Source Files: 2025-05-07T19:50:24.0454769Z 2025-05-07T19:50:24.0454875Z 2025-05-07T19:50:24.0454978Z Library Dependencies: 2025-05-07T19:50:24.0455069Z torch 2025-05-07T19:50:24.0455156Z torch_library 2025-05-07T19:50:24.0455480Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0455724Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0456059Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0456438Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0456717Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0456939Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0457049Z 2025-05-07T19:50:24.0457138Z Output Library: 2025-05-07T19:50:24.0457254Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:24.0457346Z 2025-05-07T19:50:24.0457463Z Destination Directory: 2025-05-07T19:50:24.0457548Z fbgemm_gpu 2025-05-07T19:50:24.0457673Z ================================================================================ 2025-05-07T19:50:24.0457677Z 2025-05-07T19:50:24.0457681Z 2025-05-07T19:50:24.0457686Z 2025-05-07T19:50:24.0457816Z ================================================================================ 2025-05-07T19:50:24.0457958Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:50:24.0458039Z 2025-05-07T19:50:24.0458156Z CPU_SRCS: 2025-05-07T19:50:24.0458379Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:24.0458490Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:24.0458791Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:24.0459062Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:24.0459247Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:24.0459459Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:24.0459678Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:24.0459906Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:24.0460058Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:24.0460193Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:24.0460337Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:24.0460462Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:24.0460611Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:24.0460721Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:24.0460834Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:24.0460965Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:24.0461102Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:24.0461203Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:24.0461309Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:24.0461406Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:24.0461538Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:24.0461635Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:24.0461742Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:24.0461876Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:24.0462101Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:24.0462254Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:24.0462504Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:24.0462750Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:24.0462856Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:24.0462963Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:24.0463094Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:24.0463205Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:24.0463390Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:24.0463508Z src/topology_utils.cpp 2025-05-07T19:50:24.0463582Z 2025-05-07T19:50:24.0463660Z GPU_SRCS: 2025-05-07T19:50:24.0463770Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:24.0463906Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:24.0464110Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:24.0464211Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:24.0464338Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:24.0464530Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:24.0464706Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:24.0464844Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:24.0465006Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:24.0465247Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:24.0465428Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:24.0465621Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:24.0465767Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:24.0465917Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:24.0466078Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:24.0466220Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:24.0466348Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:24.0466471Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:24.0466656Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:24.0466861Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:24.0466985Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:24.0467162Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:24.0467303Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:24.0467401Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:24.0467635Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:24.0467830Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:24.0468018Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:24.0468114Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:24.0468223Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:24.0468354Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:24.0468481Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:24.0468615Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:24.0468730Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:24.0468854Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:24.0468956Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:24.0469102Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:24.0469242Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:24.0469358Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:24.0469501Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:24.0469646Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:24.0469794Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:24.0469922Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:24.0470036Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:24.0470149Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:24.0470312Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:24.0470454Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:24.0470583Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:24.0470702Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:24.0470826Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:24.0470924Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:24.0471050Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:24.0471162Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:24.0471292Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:24.0471410Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:24.0471515Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:24.0471616Z 2025-05-07T19:50:24.0471704Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:24.0471708Z 2025-05-07T19:50:24.0471787Z 2025-05-07T19:50:24.0471884Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:24.0471887Z 2025-05-07T19:50:24.0471978Z 2025-05-07T19:50:24.0472059Z OTHER_SRCS: 2025-05-07T19:50:24.0472067Z 2025-05-07T19:50:24.0472149Z 2025-05-07T19:50:24.0472254Z CC_FLAGS: 2025-05-07T19:50:24.0472258Z 2025-05-07T19:50:24.0472339Z 2025-05-07T19:50:24.0472423Z NVCC_FLAGS: 2025-05-07T19:50:24.0472548Z --expt-relaxed-constexpr 2025-05-07T19:50:24.0472639Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:24.0472741Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:24.0472837Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:24.0472938Z 2025-05-07T19:50:24.0473018Z HIPCC_FLAGS: 2025-05-07T19:50:24.0473022Z 2025-05-07T19:50:24.0473103Z 2025-05-07T19:50:24.0473216Z INCLUDE_DIRS: 2025-05-07T19:50:24.0473321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0473415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:24.0473521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:24.0473654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:24.0473914Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:50:24.0474284Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:24.0474452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:24.0474662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:24.0474816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:24.0475010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:24.0475234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:24.0475370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:24.0475652Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:50:24.0475762Z 2025-05-07T19:50:24.0475853Z Selected Source Files: 2025-05-07T19:50:24.0475955Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:24.0476091Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:24.0476373Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:24.0476743Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:24.0476960Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:24.0477213Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:24.0477440Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:24.0477732Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:24.0477913Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:24.0478058Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:24.0478202Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:24.0478351Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:24.0478508Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:24.0478634Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:24.0478801Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:24.0478953Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:24.0479071Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:24.0479194Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:24.0479308Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:24.0479410Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:24.0479523Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:24.0479632Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:24.0479765Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:24.0479884Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:24.0480131Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:24.0480301Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:24.0480520Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:24.0480767Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:24.0480898Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:24.0481004Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:24.0481119Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:24.0481246Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:24.0481462Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:24.0481562Z src/topology_utils.cpp 2025-05-07T19:50:24.0481688Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:24.0481801Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:24.0482016Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:24.0482125Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:24.0482238Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:24.0482454Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:24.0482645Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:24.0482786Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:24.0482949Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:24.0483201Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:24.0483436Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:24.0483627Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:24.0483784Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:24.0483939Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:24.0484077Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:24.0484243Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:24.0484383Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:24.0484503Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:24.0484697Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:24.0484867Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:24.0484999Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:24.0485171Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:24.0485303Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:24.0485414Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:24.0485646Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:24.0485862Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:24.0486056Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:24.0486174Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:24.0486309Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:24.0486451Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:24.0486577Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:24.0486721Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:24.0486824Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:24.0487002Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:24.0487112Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:24.0487269Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:24.0487410Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:24.0487539Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:24.0487711Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:24.0487855Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:24.0488004Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:24.0488119Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:24.0488260Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:24.0488361Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:24.0488471Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:24.0488631Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:24.0488765Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:24.0488978Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:24.0489084Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:24.0489219Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:24.0489334Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:24.0489437Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:24.0489582Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:24.0489687Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:24.0489783Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:24.0489870Z 2025-05-07T19:50:24.0489994Z HIPified Source Files: 2025-05-07T19:50:24.0489998Z 2025-05-07T19:50:24.0490074Z 2025-05-07T19:50:24.0490176Z Library Dependencies: 2025-05-07T19:50:24.0490277Z torch 2025-05-07T19:50:24.0490370Z torch_library 2025-05-07T19:50:24.0490662Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:50:24.0490926Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:24.0491250Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:24.0491571Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:24.0491863Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:24.0491968Z fbgemm 2025-05-07T19:50:24.0492071Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:24.0492168Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:24.0492288Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:24.0492380Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:24.0492469Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:24.0492559Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:24.0492787Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:24.0492870Z 2025-05-07T19:50:24.0492960Z Output Library: 2025-05-07T19:50:24.0493067Z fbgemm_gpu_py 2025-05-07T19:50:24.0493135Z 2025-05-07T19:50:24.0493234Z Destination Directory: 2025-05-07T19:50:24.0493321Z fbgemm_gpu 2025-05-07T19:50:24.0493455Z ================================================================================ 2025-05-07T19:50:24.0493463Z 2025-05-07T19:50:24.0493564Z -- Configuring done (8.5s) 2025-05-07T19:50:24.1639973Z -- Generating done (0.1s) 2025-05-07T19:50:24.1656961Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build 2025-05-07T19:50:24.1837588Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build' 2025-05-07T19:50:24.1837632Z 2025-05-07T19:50:24.1838193Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:50:24.3435558Z [1/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:24.3587530Z [2/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:24.3818754Z [3/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:24.3839584Z [4/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:24.3859957Z [5/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:24.3915813Z [6/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:24.4019145Z [7/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:24.4058012Z [8/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:24.4077224Z [9/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:24.4176472Z [10/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:24.4471400Z [11/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:24.4551823Z [12/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:24.4597681Z [13/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:24.4629175Z [14/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:24.4660479Z [15/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:24.4722957Z [16/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:24.4733273Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp:10: 2025-05-07T19:50:24.4735035Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:24.4738175Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.4742038Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.4743905Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.4745470Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:24.4748932Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.4752707Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.4754622Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.4756130Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:24.4759728Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.4763397Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.4765392Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.4766958Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:24.4770245Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.4773800Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:24.4775599Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.4777085Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:24.4780286Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.4783915Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.4786215Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.4787708Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:24.4790956Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.4794656Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.4796694Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.4798233Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:24.4801458Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.4805090Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.4807267Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.4808822Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:24.4812086Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.4815838Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:24.4817875Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.4819431Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:24.4822704Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.4826421Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:24.4828395Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.4829937Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:24.4833337Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.4837159Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:24.4839134Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.4839811Z At global scope: 2025-05-07T19:50:24.4841101Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.4884429Z [17/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:24.4993540Z [18/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:24.5023231Z [19/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:24.5240275Z [20/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:24.5265617Z [21/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:24.5530054Z [22/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:24.5567820Z [23/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:24.5578131Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp:10: 2025-05-07T19:50:24.5579859Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:24.5582983Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5586544Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.5588626Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5590030Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:24.5593200Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5596956Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.5598824Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5600401Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:24.5603851Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5607810Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.5609658Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5611196Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:24.5614366Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5617894Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:24.5619534Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5620929Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:24.5623787Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5627444Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.5629472Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5631014Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:24.5634233Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5637894Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.5639781Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5641376Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:24.5644813Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5648885Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.5650871Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5652280Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:24.5655394Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5658717Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:24.5660822Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5662298Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:24.5665571Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5669227Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:24.5671180Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5672670Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:24.5675988Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5679937Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:24.5681828Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5682466Z At global scope: 2025-05-07T19:50:24.5683671Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.5720125Z [24/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:24.5833623Z [25/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:24.5853940Z [26/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:24.5864597Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:24.5866074Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp:13: 2025-05-07T19:50:24.5867956Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:24.5871446Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5875382Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.5877377Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5878844Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:24.5881755Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5885048Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.5886795Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5888223Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:24.5891622Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5895513Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.5897487Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5899109Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:24.5902609Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5906409Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:24.5908272Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5909928Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:24.5913258Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5917062Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.5919089Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5920699Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:24.5923924Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5927601Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.5929336Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5930712Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:24.5933778Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5937702Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.5939891Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5941393Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:24.5944707Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5948524Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:24.5950317Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5951915Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:24.5955345Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5959237Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:24.5961154Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5962534Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:24.5965669Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.5969323Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:24.5971200Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.5971784Z At global scope: 2025-05-07T19:50:24.5973024Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.5982876Z [27/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:24.5996680Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:24.5997866Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp:11: 2025-05-07T19:50:24.5999559Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:24.6002489Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6006345Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.6008140Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6009486Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:24.6012936Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6016839Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.6018896Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6020516Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:24.6023959Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6027558Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.6029334Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6030799Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:24.6033874Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6037323Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:24.6038980Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6040555Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:24.6043838Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6047691Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.6049509Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6050998Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:24.6054239Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6057964Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.6059936Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6061448Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:24.6064404Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6067818Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.6069564Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6070936Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:24.6074105Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6077979Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:24.6079971Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6081649Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:24.6084793Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6088335Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:24.6090082Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6091659Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:24.6094917Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6098602Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:24.6100648Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6101239Z At global scope: 2025-05-07T19:50:24.6102370Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.6111517Z [28/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:24.6120862Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:24.6122069Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp:13: 2025-05-07T19:50:24.6123679Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:24.6126612Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6129987Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.6131723Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6133128Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:24.6136280Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6139891Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.6141500Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6143046Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:24.6146252Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6149799Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.6151665Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6153218Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:24.6156422Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6159615Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:24.6161306Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6162697Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:24.6165890Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6169556Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.6171627Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6173212Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:24.6176459Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6180097Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.6182035Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6183599Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:24.6186702Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6190559Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.6192764Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6194210Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:24.6197212Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6200358Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:24.6202103Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6203486Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:24.6206632Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6210460Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:24.6212481Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6214106Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:24.6217707Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6221110Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:24.6222841Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6223407Z At global scope: 2025-05-07T19:50:24.6224644Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.6236098Z [29/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:24.6255238Z [30/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:24.6537743Z [31/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:24.6549335Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/a64archtraits_p.h:13, 2025-05-07T19:50:24.6550692Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp:16: 2025-05-07T19:50:24.6552625Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:24.6555741Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6559061Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.6561090Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6562709Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:24.6566417Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6570334Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.6572369Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6573993Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:24.6577272Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6580997Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.6582995Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6584588Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:24.6587850Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6591538Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:24.6593592Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6595094Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:24.6598617Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6602291Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.6604227Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6605705Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:24.6608956Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6612793Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.6614835Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6616371Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:24.6619602Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6623525Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.6625589Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6627221Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:24.6630536Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6633871Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:24.6635792Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6637675Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:24.6640940Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6644506Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:24.6646645Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6648233Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:24.6651498Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6654951Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:24.6657078Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6657673Z At global scope: 2025-05-07T19:50:24.6658780Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.6668073Z [32/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:24.6686822Z [33/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:24.6860228Z [34/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:24.6894891Z [35/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:24.6905888Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp:12: 2025-05-07T19:50:24.6907759Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:24.6910978Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6914565Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.6916686Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6918272Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:24.6921390Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6925281Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.6927275Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6928808Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:24.6932010Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6935680Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.6937589Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6939091Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:24.6942513Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6946073Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:24.6948170Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6949729Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:24.6952975Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6956720Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.6958684Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6960172Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:24.6963325Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6966977Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.6969191Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6970670Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:24.6973898Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6977548Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.6979498Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6981023Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:24.6984228Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6988090Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:24.6990022Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.6991622Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:24.6994856Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.6998749Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:24.7000821Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7002371Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:24.7005650Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7009359Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:24.7011365Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7012000Z At global scope: 2025-05-07T19:50:24.7013242Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.7025924Z [36/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:24.7085222Z [37/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:24.7095696Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:24.7097021Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:24.7098162Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp:9: 2025-05-07T19:50:24.7099904Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:24.7103285Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7106988Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.7108875Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7110409Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:24.7113719Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7117936Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.7119875Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7121456Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:24.7124771Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7128519Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.7130451Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7131981Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:24.7135408Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7138912Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:24.7140694Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7142267Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:24.7145570Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7149833Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.7151774Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7153320Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:24.7156714Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7160472Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.7162710Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7164330Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:24.7167649Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7171337Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.7173276Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7174784Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:24.7178113Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7182109Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:24.7184107Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7185694Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:24.7189045Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7192842Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:24.7194952Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7196743Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:24.7200270Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7204044Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:24.7206034Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7206853Z At global scope: 2025-05-07T19:50:24.7208092Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.7447722Z [38/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:24.7698036Z [39/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:24.7793556Z [40/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:24.7803608Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:24.7804904Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:24.7806005Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp:9: 2025-05-07T19:50:24.7808051Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:24.7811315Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7814795Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.7816611Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7818163Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:24.7821317Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7824813Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.7826873Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7828366Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:24.7831485Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7834941Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.7836839Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7838260Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:24.7841366Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7844640Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:24.7846311Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7848043Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:24.7851404Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7854698Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.7856493Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7858083Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:24.7861264Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7864902Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.7866784Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7868589Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:24.7871948Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7875627Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.7877552Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7879093Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:24.7882307Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7885939Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:24.7887848Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7889410Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:24.7892700Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7896433Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:24.7898470Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7900075Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:24.7903536Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.7907041Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:24.7908994Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.7909616Z At global scope: 2025-05-07T19:50:24.7910735Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.8233223Z [41/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:24.8243755Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:24.8245063Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64emithelper_p.h:13, 2025-05-07T19:50:24.8246197Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp:14: 2025-05-07T19:50:24.8248165Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:24.8251515Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.8255517Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.8257458Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.8259342Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:24.8262839Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.8266617Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.8268567Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.8270144Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:24.8273591Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.8277302Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.8279411Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.8280957Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:24.8284302Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.8287728Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:24.8289600Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.8291293Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:24.8294969Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.8299011Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:24.8301101Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.8302686Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:24.8306203Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.8310144Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:24.8312233Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.8313922Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:24.8317603Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.8321606Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:24.8323668Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.8325472Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:24.8329074Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.8333077Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:24.8335132Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.8336801Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:24.8340355Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.8344376Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:24.8346573Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.8348006Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:24.8351353Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:24.8355254Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:24.8357324Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:24.8357956Z At global scope: 2025-05-07T19:50:24.8359109Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:24.8369575Z [42/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:24.8400168Z [43/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:24.8656925Z [44/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:24.9524539Z [45/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:24.9916878Z [46/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:25.0268853Z [47/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:25.0681748Z [48/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:25.1983259Z [49/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:25.2858396Z [50/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:25.2877225Z [51/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:25.2887639Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:25.2888997Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:25.2890138Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp:12: 2025-05-07T19:50:25.2891936Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:25.2895281Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.2898994Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.2900992Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.2902575Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:25.2906010Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.2909744Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.2911687Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.2913253Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:25.2916646Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.2920372Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.2922298Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.2923851Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:25.2927140Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.2930707Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:25.2932678Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.2934243Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:25.2937575Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.2941270Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:25.2943245Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.2944824Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:25.2948366Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.2952059Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:25.2954240Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.2955819Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:25.2959213Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.2962929Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:25.2964888Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.2966461Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:25.2969787Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.2973494Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:25.2975423Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.2977033Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:25.2980622Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.2984331Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:25.2986310Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.2987944Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:25.2991240Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:25.2994954Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:25.2997064Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:25.2997703Z At global scope: 2025-05-07T19:50:25.2999120Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:25.3755292Z [52/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:25.5835470Z [53/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:25.6043702Z [54/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:25.7450153Z [55/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:25.7775911Z [56/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:25.8488363Z [57/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:25.9041177Z [58/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:50:25.9568008Z [59/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:26.4359251Z [60/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:26.4370183Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:26.4371549Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:26.4372719Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp:18: 2025-05-07T19:50:26.4374614Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:26.4378080Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.4382047Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:26.4384052Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.4385711Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:26.4389186Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.4393272Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:26.4395211Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.4396921Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:26.4400321Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.4403771Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:26.4405774Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.4407326Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:26.4410878Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.4414564Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:26.4416408Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.4417998Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:26.4421477Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.4425303Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:26.4427299Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.4428972Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:26.4432451Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.4436442Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:26.4438771Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.4440359Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:26.4443863Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.4448104Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:26.4450124Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.4451748Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:26.4455120Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.4459264Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:26.4461332Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.4463006Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:26.4466528Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.4470485Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:26.4472506Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.4474213Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:26.4477901Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:26.4481830Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:26.4483906Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:26.4484858Z At global scope: 2025-05-07T19:50:26.4486089Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:26.8140825Z [61/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:27.2042497Z [62/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:50:27.6912663Z [63/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:28.2886623Z [64/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:28.3905117Z [65/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:50:30.7860867Z [66/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:50:32.9785806Z [67/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:33.0019412Z [68/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:33.0183374Z [69/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:33.0343310Z [70/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:33.1257893Z [71/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:34.4338628Z [72/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:34.6677381Z [73/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:35.3188469Z [74/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:35.4645299Z [75/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:37.1675959Z [76/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:37.8721989Z [77/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:50:38.4135295Z [78/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:41.0057008Z [79/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:41.3718381Z [80/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:50:42.6006872Z [81/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:42.8657080Z [82/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:45.1082934Z [83/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:46.9122798Z [84/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:47.3907308Z [85/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:50.2689447Z [86/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:51.5837329Z [87/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:52.2657704Z [88/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:56.9426860Z [89/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:56.9591810Z [90/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:51:00.3685927Z [91/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:01.8501539Z [92/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:01.8663939Z [93/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:03.4867824Z [94/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:06.9269838Z [95/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:07.1187563Z [96/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:10.5639116Z [97/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:11.9706430Z [98/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:12.1541654Z [99/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:13.6919048Z [100/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:17.1122344Z [101/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:26.9042992Z [102/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:27.8868299Z [103/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:32.2448236Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:51:32.5722902Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:51:33.3820254Z [106/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:51:33.6937013Z [107/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:51:34.0899072Z [108/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T19:51:34.2437351Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:51:34.3249243Z [110/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:51:34.5524957Z [111/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:51:34.7396675Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:51:34.9551590Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:51:35.3409326Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:51:38.1368987Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:51:45.4170434Z [116/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:51:47.5309658Z [117/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:51:48.1277590Z [118/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T19:51:50.9194151Z [119/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:51:54.3531412Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:52:00.4472596Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:52:00.8851523Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:52:19.4957453Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:52:24.8878979Z [124/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:28.0557158Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:52:33.3236080Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:52:33.7640032Z [127/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:52:34.0123492Z [128/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:52:34.3551227Z [129/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:52:38.4236343Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:46.0082092Z [131/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:52:50.6376683Z [132/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:53.2166305Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:52:55.0794306Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:58.8309843Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:04.0590014Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:04.0613982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0616039Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0616706Z ^ 2025-05-07T19:53:04.0617043Z 2025-05-07T19:53:04.0617504Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.0618185Z 2025-05-07T19:53:04.0619888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0621956Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0622558Z ^ 2025-05-07T19:53:04.0622873Z 2025-05-07T19:53:04.0624488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0626578Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0627155Z ^ 2025-05-07T19:53:04.0627461Z 2025-05-07T19:53:04.0629392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0631278Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0631827Z ^ 2025-05-07T19:53:04.0632122Z 2025-05-07T19:53:04.0632540Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.0633171Z 2025-05-07T19:53:04.0634697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0636673Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0637267Z ^ 2025-05-07T19:53:04.0637564Z 2025-05-07T19:53:04.0639151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0641065Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0641647Z ^ 2025-05-07T19:53:04.0641966Z 2025-05-07T19:53:04.0643390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0645309Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0645896Z ^ 2025-05-07T19:53:04.0646199Z 2025-05-07T19:53:04.0646917Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.0647598Z 2025-05-07T19:53:04.0649011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0650925Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0651439Z ^ 2025-05-07T19:53:04.0652050Z 2025-05-07T19:53:04.0653583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0655485Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0656059Z ^ 2025-05-07T19:53:04.0656332Z 2025-05-07T19:53:04.0657714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0659615Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0660210Z ^ 2025-05-07T19:53:04.0660508Z 2025-05-07T19:53:04.0660969Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.0661662Z 2025-05-07T19:53:04.0663310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0665266Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0665833Z ^ 2025-05-07T19:53:04.0666168Z 2025-05-07T19:53:04.0667775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0669795Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0670361Z ^ 2025-05-07T19:53:04.0670657Z 2025-05-07T19:53:04.0674486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0676750Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0677338Z ^ 2025-05-07T19:53:04.0677639Z 2025-05-07T19:53:04.0678075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:04.0678764Z 2025-05-07T19:53:04.0680364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0682211Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0682701Z ^ 2025-05-07T19:53:04.0683013Z 2025-05-07T19:53:04.0684548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:04.0686470Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:04.0686977Z ^ 2025-05-07T19:53:04.0687258Z 2025-05-07T19:53:13.0195088Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:13.0217765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0219800Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:13.0220498Z ^ 2025-05-07T19:53:13.0220851Z 2025-05-07T19:53:13.0221313Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.0221926Z 2025-05-07T19:53:13.0223331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0225085Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0225648Z ^ 2025-05-07T19:53:13.0225929Z 2025-05-07T19:53:13.0227358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0229071Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0229619Z ^ 2025-05-07T19:53:13.0229903Z 2025-05-07T19:53:13.0231299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0233045Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0233587Z ^ 2025-05-07T19:53:13.0233828Z 2025-05-07T19:53:13.0235243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0237331Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:13.0238041Z ^ 2025-05-07T19:53:13.0238580Z 2025-05-07T19:53:13.0239001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.0239581Z 2025-05-07T19:53:13.0240915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0242611Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0243105Z ^ 2025-05-07T19:53:13.0243380Z 2025-05-07T19:53:13.0244726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0246794Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0247275Z ^ 2025-05-07T19:53:13.0247544Z 2025-05-07T19:53:13.0248876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0250593Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0251080Z ^ 2025-05-07T19:53:13.0251322Z 2025-05-07T19:53:13.0252659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0254544Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:13.0255290Z ^ 2025-05-07T19:53:13.0255543Z 2025-05-07T19:53:13.0256211Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.0256806Z 2025-05-07T19:53:13.0258138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0259881Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0260381Z ^ 2025-05-07T19:53:13.0260657Z 2025-05-07T19:53:13.0261996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0263748Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0264217Z ^ 2025-05-07T19:53:13.0264490Z 2025-05-07T19:53:13.0265897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0267672Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0268182Z ^ 2025-05-07T19:53:13.0268438Z 2025-05-07T19:53:13.0269862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0271779Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:13.0272484Z ^ 2025-05-07T19:53:13.0272788Z 2025-05-07T19:53:13.0273178Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.0273789Z 2025-05-07T19:53:13.0275127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0277257Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0277743Z ^ 2025-05-07T19:53:13.0278029Z 2025-05-07T19:53:13.0279339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0281076Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0281608Z ^ 2025-05-07T19:53:13.0281886Z 2025-05-07T19:53:13.0283244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0285020Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0285596Z ^ 2025-05-07T19:53:13.0285855Z 2025-05-07T19:53:13.0287190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0289135Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:13.0289907Z ^ 2025-05-07T19:53:13.0290149Z 2025-05-07T19:53:13.0290549Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.0291155Z 2025-05-07T19:53:13.0292550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0294515Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0295026Z ^ 2025-05-07T19:53:13.0295303Z 2025-05-07T19:53:13.0296694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0298489Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0298983Z ^ 2025-05-07T19:53:13.0299222Z 2025-05-07T19:53:13.0300633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.0302380Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.0302884Z ^ 2025-05-07T19:53:13.0303129Z 2025-05-07T19:53:13.0572058Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:53:13.7577079Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:13.7597263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7599731Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:13.7600428Z ^ 2025-05-07T19:53:13.7600692Z 2025-05-07T19:53:13.7601043Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.7601581Z 2025-05-07T19:53:13.7602970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7604583Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7605164Z ^ 2025-05-07T19:53:13.7605459Z 2025-05-07T19:53:13.7606675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7608489Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7608992Z ^ 2025-05-07T19:53:13.7609217Z 2025-05-07T19:53:13.7610459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7612114Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7612574Z ^ 2025-05-07T19:53:13.7612841Z 2025-05-07T19:53:13.7614488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7616274Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:13.7617030Z ^ 2025-05-07T19:53:13.7617313Z 2025-05-07T19:53:13.7617658Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.7618164Z 2025-05-07T19:53:13.7619533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7621107Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7621608Z ^ 2025-05-07T19:53:13.7621887Z 2025-05-07T19:53:13.7623182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7624906Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7625404Z ^ 2025-05-07T19:53:13.7625625Z 2025-05-07T19:53:13.7626838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7628582Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7629025Z ^ 2025-05-07T19:53:13.7629275Z 2025-05-07T19:53:13.7630500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7632365Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:13.7632977Z ^ 2025-05-07T19:53:13.7633246Z 2025-05-07T19:53:13.7633632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.7634489Z 2025-05-07T19:53:13.7635690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7637595Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7638043Z ^ 2025-05-07T19:53:13.7638296Z 2025-05-07T19:53:13.7639653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7641231Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7641715Z ^ 2025-05-07T19:53:13.7641992Z 2025-05-07T19:53:13.7643291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7645034Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7645482Z ^ 2025-05-07T19:53:13.7645705Z 2025-05-07T19:53:13.7647220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7648999Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:13.7649654Z ^ 2025-05-07T19:53:13.7649920Z 2025-05-07T19:53:13.7650402Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.7651198Z 2025-05-07T19:53:13.7652439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7654203Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7654648Z ^ 2025-05-07T19:53:13.7654909Z 2025-05-07T19:53:13.7656299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7657896Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7658352Z ^ 2025-05-07T19:53:13.7658629Z 2025-05-07T19:53:13.7659978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7661625Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7662177Z ^ 2025-05-07T19:53:13.7662397Z 2025-05-07T19:53:13.7663641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7665482Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:13.7666107Z ^ 2025-05-07T19:53:13.7666349Z 2025-05-07T19:53:13.7666756Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:13.7667385Z 2025-05-07T19:53:13.7668651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7670406Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7671149Z ^ 2025-05-07T19:53:13.7671420Z 2025-05-07T19:53:13.7672721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7674450Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7674890Z ^ 2025-05-07T19:53:13.7675131Z 2025-05-07T19:53:13.7676686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:13.7678220Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:13.7678742Z ^ 2025-05-07T19:53:13.7679017Z 2025-05-07T19:53:16.2145074Z [140/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:53:16.8768612Z [141/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:17.4591129Z [142/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:20.3763311Z [143/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:22.6628600Z [144/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:33.5672835Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:53:37.1859742Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:53:40.7388362Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:44.0484625Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:46.2472330Z [149/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:54:03.7027277Z [150/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:54:09.2067142Z [151/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:09.3345327Z [152/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:09.7671522Z [153/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:10.0484476Z [154/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:54:12.9761564Z [155/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:54:16.0849084Z [156/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:20.3207841Z [157/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:20.3230692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3232651Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3233903Z ^ 2025-05-07T19:54:20.3237277Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:20.3240398Z 2025-05-07T19:54:20.3240836Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.3241456Z 2025-05-07T19:54:20.3242695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3244613Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3245528Z ^ 2025-05-07T19:54:20.3249420Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:20.3252822Z 2025-05-07T19:54:20.3254559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3256506Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3257370Z ^ 2025-05-07T19:54:20.3260789Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:20.3263754Z 2025-05-07T19:54:20.3264957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3266814Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3267660Z ^ 2025-05-07T19:54:20.3270975Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:20.3274281Z 2025-05-07T19:54:20.3275586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3277692Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3278943Z ^ 2025-05-07T19:54:20.3282384Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:20.3285566Z 2025-05-07T19:54:20.3286874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3288739Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3289497Z ^ 2025-05-07T19:54:20.3292299Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:20.3295246Z 2025-05-07T19:54:20.3296509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3298313Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3299188Z ^ 2025-05-07T19:54:20.3302124Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:20.3305367Z 2025-05-07T19:54:20.3306772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3308766Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3309690Z ^ 2025-05-07T19:54:20.3313197Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:20.3316681Z 2025-05-07T19:54:20.3318000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3319892Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3320721Z ^ 2025-05-07T19:54:20.3324233Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:20.3327177Z 2025-05-07T19:54:20.3328424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3330398Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3331273Z ^ 2025-05-07T19:54:20.3334704Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:20.3337817Z 2025-05-07T19:54:20.3339115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3341490Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3342381Z ^ 2025-05-07T19:54:20.3346011Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:20.3349212Z 2025-05-07T19:54:20.3350426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3352331Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3353112Z ^ 2025-05-07T19:54:20.3356644Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:20.3359744Z 2025-05-07T19:54:20.3361016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3363030Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3363955Z ^ 2025-05-07T19:54:20.3367950Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:20.3371218Z 2025-05-07T19:54:20.3372388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3374294Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3375238Z ^ 2025-05-07T19:54:20.3378620Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:20.3381706Z 2025-05-07T19:54:20.3382915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3384831Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3385918Z ^ 2025-05-07T19:54:20.3388840Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:20.3391756Z 2025-05-07T19:54:20.3392911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3394644Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3395445Z ^ 2025-05-07T19:54:20.3398814Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:20.3401768Z 2025-05-07T19:54:20.3402899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3404833Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3405671Z ^ 2025-05-07T19:54:20.3408793Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:20.3411956Z 2025-05-07T19:54:20.3413196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3415293Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3416154Z ^ 2025-05-07T19:54:20.3419422Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:20.3422390Z 2025-05-07T19:54:20.3423537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3425361Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3426164Z ^ 2025-05-07T19:54:20.3429838Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:20.3433041Z 2025-05-07T19:54:20.3434244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3436375Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3437262Z ^ 2025-05-07T19:54:20.3440343Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:20.3443662Z 2025-05-07T19:54:20.3444828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3446945Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3447832Z ^ 2025-05-07T19:54:20.3451333Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:20.3455100Z 2025-05-07T19:54:20.3456365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3458345Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3459126Z ^ 2025-05-07T19:54:20.3462246Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:20.3465221Z 2025-05-07T19:54:20.3466419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3468230Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3469129Z ^ 2025-05-07T19:54:20.3472906Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:20.3476105Z 2025-05-07T19:54:20.3477349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3478961Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3479725Z ^ 2025-05-07T19:54:20.3482911Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:20.3486170Z 2025-05-07T19:54:20.3487419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3489365Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3490259Z ^ 2025-05-07T19:54:20.3493673Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:20.3497119Z 2025-05-07T19:54:20.3497568Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.3498257Z 2025-05-07T19:54:20.3499487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3501609Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3502490Z ^ 2025-05-07T19:54:20.3506018Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:20.3509256Z 2025-05-07T19:54:20.3510589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3512546Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3513346Z ^ 2025-05-07T19:54:20.3517238Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:20.3520512Z 2025-05-07T19:54:20.3521852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3523823Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3524744Z ^ 2025-05-07T19:54:20.3528270Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:20.3531469Z 2025-05-07T19:54:20.3532773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3534678Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3535584Z ^ 2025-05-07T19:54:20.3538983Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:20.3542478Z 2025-05-07T19:54:20.3543756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3545753Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3546973Z ^ 2025-05-07T19:54:20.3550456Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:20.3553557Z 2025-05-07T19:54:20.3554742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3556806Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3557904Z ^ 2025-05-07T19:54:20.3561362Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:20.3564331Z 2025-05-07T19:54:20.3565581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3567363Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3568196Z ^ 2025-05-07T19:54:20.3571483Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:20.3574358Z 2025-05-07T19:54:20.3575499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3577254Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3578042Z ^ 2025-05-07T19:54:20.3581245Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:20.3584567Z 2025-05-07T19:54:20.3585698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3587656Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3588555Z ^ 2025-05-07T19:54:20.3592078Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:20.3595558Z 2025-05-07T19:54:20.3596968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3598895Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3599808Z ^ 2025-05-07T19:54:20.3603421Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:20.3606341Z 2025-05-07T19:54:20.3607561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3609434Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3610341Z ^ 2025-05-07T19:54:20.3613204Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:20.3616025Z 2025-05-07T19:54:20.3617210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3619172Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3620054Z ^ 2025-05-07T19:54:20.3623516Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:20.3626960Z 2025-05-07T19:54:20.3628248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3629924Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3630737Z ^ 2025-05-07T19:54:20.3633948Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:20.3636982Z 2025-05-07T19:54:20.3638103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3639921Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3640814Z ^ 2025-05-07T19:54:20.3647870Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:20.3651259Z 2025-05-07T19:54:20.3652595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3654395Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3655355Z ^ 2025-05-07T19:54:20.3658377Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:20.3661316Z 2025-05-07T19:54:20.3662523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3664434Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3665275Z ^ 2025-05-07T19:54:20.3668663Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:20.3672159Z 2025-05-07T19:54:20.3673372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3675341Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3676366Z ^ 2025-05-07T19:54:20.3679792Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:20.3682902Z 2025-05-07T19:54:20.3684032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3686052Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3686919Z ^ 2025-05-07T19:54:20.3690612Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:20.3693815Z 2025-05-07T19:54:20.3694940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3696881Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3697839Z ^ 2025-05-07T19:54:20.3701263Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:20.3704606Z 2025-05-07T19:54:20.3705962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3707901Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3708775Z ^ 2025-05-07T19:54:20.3712226Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:20.3715731Z 2025-05-07T19:54:20.3717215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3719256Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3720189Z ^ 2025-05-07T19:54:20.3723847Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:20.3727397Z 2025-05-07T19:54:20.3728687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3730653Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3731517Z ^ 2025-05-07T19:54:20.3735024Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:20.3738098Z 2025-05-07T19:54:20.3739337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3741294Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3742210Z ^ 2025-05-07T19:54:20.3745716Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:20.3749217Z 2025-05-07T19:54:20.3750472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3752384Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3753255Z ^ 2025-05-07T19:54:20.3756987Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:20.3760424Z 2025-05-07T19:54:20.3760890Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.3761576Z 2025-05-07T19:54:20.3762945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3765003Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3765945Z ^ 2025-05-07T19:54:20.3769548Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:20.3772883Z 2025-05-07T19:54:20.3774056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3776071Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3777005Z ^ 2025-05-07T19:54:20.3780991Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:20.3784265Z 2025-05-07T19:54:20.3785759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3787722Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3788651Z ^ 2025-05-07T19:54:20.3792203Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:20.3795478Z 2025-05-07T19:54:20.3796877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3798886Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3799807Z ^ 2025-05-07T19:54:20.3803453Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:20.3807118Z 2025-05-07T19:54:20.3808447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3810512Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3811449Z ^ 2025-05-07T19:54:20.3815093Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:20.3818415Z 2025-05-07T19:54:20.3819630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3821623Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3822532Z ^ 2025-05-07T19:54:20.3826201Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:20.3829501Z 2025-05-07T19:54:20.3830789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3832791Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3833690Z ^ 2025-05-07T19:54:20.3837257Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:20.3840537Z 2025-05-07T19:54:20.3841822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3843783Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3844684Z ^ 2025-05-07T19:54:20.3848409Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:20.3851656Z 2025-05-07T19:54:20.3853252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3854959Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3855788Z ^ 2025-05-07T19:54:20.3859104Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:20.3862202Z 2025-05-07T19:54:20.3863441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3865427Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3866169Z ^ 2025-05-07T19:54:20.3869823Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:20.3873033Z 2025-05-07T19:54:20.3874340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3876414Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3877311Z ^ 2025-05-07T19:54:20.3880802Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:20.3883795Z 2025-05-07T19:54:20.3884958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3886670Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3887488Z ^ 2025-05-07T19:54:20.3890698Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:20.3893702Z 2025-05-07T19:54:20.3895151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3897477Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3898565Z ^ 2025-05-07T19:54:20.3902113Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:20.3905451Z 2025-05-07T19:54:20.3906543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3908435Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3909311Z ^ 2025-05-07T19:54:20.3912762Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:20.3916662Z 2025-05-07T19:54:20.3917956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3919970Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3920850Z ^ 2025-05-07T19:54:20.3924282Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:20.3926982Z 2025-05-07T19:54:20.3928137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3930029Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3930757Z ^ 2025-05-07T19:54:20.3934015Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:20.3936882Z 2025-05-07T19:54:20.3938163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3940220Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3941071Z ^ 2025-05-07T19:54:20.3944570Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:20.3948496Z 2025-05-07T19:54:20.3949843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3952135Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3953067Z ^ 2025-05-07T19:54:20.3956493Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:20.3959569Z 2025-05-07T19:54:20.3961034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3962769Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3963655Z ^ 2025-05-07T19:54:20.3967070Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:20.3970253Z 2025-05-07T19:54:20.3971487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3973421Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3974343Z ^ 2025-05-07T19:54:20.3977891Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:20.3981089Z 2025-05-07T19:54:20.3982215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3984410Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3985345Z ^ 2025-05-07T19:54:20.3988857Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:20.3992159Z 2025-05-07T19:54:20.3993397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.3995326Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.3996401Z ^ 2025-05-07T19:54:20.4000034Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:20.4003537Z 2025-05-07T19:54:20.4005000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4006996Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4007876Z ^ 2025-05-07T19:54:20.4011317Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:20.4014613Z 2025-05-07T19:54:20.4015807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4017661Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4018548Z ^ 2025-05-07T19:54:20.4021552Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:20.4024289Z 2025-05-07T19:54:20.4024693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.4025277Z 2025-05-07T19:54:20.4026386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4028322Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4029138Z ^ 2025-05-07T19:54:20.4032247Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:20.4035693Z 2025-05-07T19:54:20.4037006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4038794Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4039635Z ^ 2025-05-07T19:54:20.4042719Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:20.4045749Z 2025-05-07T19:54:20.4047463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4049605Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4050531Z ^ 2025-05-07T19:54:20.4053634Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:20.4056594Z 2025-05-07T19:54:20.4057794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4059574Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4060408Z ^ 2025-05-07T19:54:20.4063752Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:20.4066942Z 2025-05-07T19:54:20.4068208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4070416Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4071317Z ^ 2025-05-07T19:54:20.4074633Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:20.4077825Z 2025-05-07T19:54:20.4079082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4080983Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4081820Z ^ 2025-05-07T19:54:20.4085370Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:20.4088674Z 2025-05-07T19:54:20.4090129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4092121Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4093037Z ^ 2025-05-07T19:54:20.4096108Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:20.4099100Z 2025-05-07T19:54:20.4100327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4102152Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4103013Z ^ 2025-05-07T19:54:20.4106466Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:20.4109526Z 2025-05-07T19:54:20.4110799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4112570Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4113829Z ^ 2025-05-07T19:54:20.4117120Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:20.4120364Z 2025-05-07T19:54:20.4121643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4123718Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4124626Z ^ 2025-05-07T19:54:20.4128218Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:20.4131441Z 2025-05-07T19:54:20.4132859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4134825Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4135744Z ^ 2025-05-07T19:54:20.4139133Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:20.4142362Z 2025-05-07T19:54:20.4143595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4145565Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4146719Z ^ 2025-05-07T19:54:20.4149856Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:20.4153026Z 2025-05-07T19:54:20.4154327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4156432Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4157699Z ^ 2025-05-07T19:54:20.4161124Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:20.4164407Z 2025-05-07T19:54:20.4165678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4167686Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4168573Z ^ 2025-05-07T19:54:20.4171755Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:20.4175083Z 2025-05-07T19:54:20.4176406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4178778Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4179665Z ^ 2025-05-07T19:54:20.4183245Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:20.4186566Z 2025-05-07T19:54:20.4187797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4189532Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4190417Z ^ 2025-05-07T19:54:20.4193820Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:20.4197111Z 2025-05-07T19:54:20.4198273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4200056Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4200929Z ^ 2025-05-07T19:54:20.4204429Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:20.4207393Z 2025-05-07T19:54:20.4208531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4210258Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4211063Z ^ 2025-05-07T19:54:20.4214323Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:20.4217261Z 2025-05-07T19:54:20.4218394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4220334Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4221213Z ^ 2025-05-07T19:54:20.4224807Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:20.4228207Z 2025-05-07T19:54:20.4229463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4231470Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4232378Z ^ 2025-05-07T19:54:20.4235827Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:20.4239073Z 2025-05-07T19:54:20.4240189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4242069Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4242911Z ^ 2025-05-07T19:54:20.4246199Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:20.4249181Z 2025-05-07T19:54:20.4250319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4252162Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4253058Z ^ 2025-05-07T19:54:20.4256588Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:20.4259869Z 2025-05-07T19:54:20.4261164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4263141Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4264242Z ^ 2025-05-07T19:54:20.4267282Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:20.4270303Z 2025-05-07T19:54:20.4271448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4273188Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4274038Z ^ 2025-05-07T19:54:20.4277561Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:20.4280486Z 2025-05-07T19:54:20.4280935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:20.4281578Z 2025-05-07T19:54:20.4282827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4284961Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4286068Z ^ 2025-05-07T19:54:20.4289580Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:20.4292786Z 2025-05-07T19:54:20.4293957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4295752Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4296610Z ^ 2025-05-07T19:54:20.4299982Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:20.4303119Z 2025-05-07T19:54:20.4304352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4306468Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4307331Z ^ 2025-05-07T19:54:20.4310764Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:20.4313773Z 2025-05-07T19:54:20.4315013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4317022Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4317795Z ^ 2025-05-07T19:54:20.4321053Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:20.4324083Z 2025-05-07T19:54:20.4325302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4327206Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4328082Z ^ 2025-05-07T19:54:20.4331669Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:20.4334673Z 2025-05-07T19:54:20.4335966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4337901Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4338860Z ^ 2025-05-07T19:54:20.4342185Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:20.4345430Z 2025-05-07T19:54:20.4346980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4349177Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4350440Z ^ 2025-05-07T19:54:20.4354163Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:20.4357504Z 2025-05-07T19:54:20.4358827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4360762Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4361693Z ^ 2025-05-07T19:54:20.4365157Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:20.4368222Z 2025-05-07T19:54:20.4369406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4371203Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4372083Z ^ 2025-05-07T19:54:20.4375325Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:20.4378847Z 2025-05-07T19:54:20.4380107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4382022Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4382916Z ^ 2025-05-07T19:54:20.4386373Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:20.4389536Z 2025-05-07T19:54:20.4390801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4392759Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4393774Z ^ 2025-05-07T19:54:20.4397525Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:20.4400993Z 2025-05-07T19:54:20.4402313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4404507Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4405400Z ^ 2025-05-07T19:54:20.4408819Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:20.4412076Z 2025-05-07T19:54:20.4413346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4415359Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4416269Z ^ 2025-05-07T19:54:20.4419830Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:20.4423326Z 2025-05-07T19:54:20.4424608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4426592Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4427524Z ^ 2025-05-07T19:54:20.4431086Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:20.4434488Z 2025-05-07T19:54:20.4435835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4438180Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4439133Z ^ 2025-05-07T19:54:20.4442937Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:20.4446740Z 2025-05-07T19:54:20.4448168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4450139Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4451071Z ^ 2025-05-07T19:54:20.4454388Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:20.4457681Z 2025-05-07T19:54:20.4458975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4460954Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4461892Z ^ 2025-05-07T19:54:20.4465490Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:20.4469104Z 2025-05-07T19:54:20.4470393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4472287Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4473208Z ^ 2025-05-07T19:54:20.4476875Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:20.4498940Z 2025-05-07T19:54:20.4500127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4501849Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4502697Z ^ 2025-05-07T19:54:20.4506400Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:20.4509668Z 2025-05-07T19:54:20.4510960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4512847Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4513759Z ^ 2025-05-07T19:54:20.4517371Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:20.4520347Z 2025-05-07T19:54:20.4521419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4523234Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4524102Z ^ 2025-05-07T19:54:20.4527345Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:20.4530939Z 2025-05-07T19:54:20.4532225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4534190Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4535102Z ^ 2025-05-07T19:54:20.4538641Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:20.4541630Z 2025-05-07T19:54:20.4542845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:20.4544700Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:20.4545601Z ^ 2025-05-07T19:54:20.4549786Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:20.4553085Z 2025-05-07T19:54:23.5867830Z [158/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:25.8463640Z [159/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:25.8653300Z [160/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:28.9933734Z [161/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:29.2705982Z [162/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:29.5725959Z [163/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:31.7001249Z [164/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:31.7140176Z [165/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.7281170Z [166/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.7419416Z [167/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.7561710Z [168/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.7695133Z [169/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.7834772Z [170/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.7973227Z [171/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.8113398Z [172/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.8248755Z [173/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.8387316Z [174/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.8525784Z [175/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.8664239Z [176/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:31.8800223Z [177/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:32.0230380Z [178/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:33.2123969Z [179/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:35.2251570Z [180/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:36.9674645Z [181/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:37.3397843Z [182/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:38.0434752Z [183/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:38.7068899Z [184/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:41.4512636Z [185/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:41.4940223Z [186/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:41.8263798Z [187/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:41.8288263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8290295Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8291115Z ^ 2025-05-07T19:54:41.8291456Z 2025-05-07T19:54:41.8291918Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.8292585Z 2025-05-07T19:54:41.8294260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8296183Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8296776Z ^ 2025-05-07T19:54:41.8297082Z 2025-05-07T19:54:41.8298748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8300789Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8301351Z ^ 2025-05-07T19:54:41.8301650Z 2025-05-07T19:54:41.8303290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8305332Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8305925Z ^ 2025-05-07T19:54:41.8306226Z 2025-05-07T19:54:41.8306686Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.8307405Z 2025-05-07T19:54:41.8309122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8310995Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8311559Z ^ 2025-05-07T19:54:41.8311898Z 2025-05-07T19:54:41.8313536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8315741Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8316408Z ^ 2025-05-07T19:54:41.8316715Z 2025-05-07T19:54:41.8318373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8320370Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8320956Z ^ 2025-05-07T19:54:41.8321259Z 2025-05-07T19:54:41.8321711Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.8322419Z 2025-05-07T19:54:41.8324236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8326322Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8326850Z ^ 2025-05-07T19:54:41.8327147Z 2025-05-07T19:54:41.8328741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8330807Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8331377Z ^ 2025-05-07T19:54:41.8331659Z 2025-05-07T19:54:41.8333497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8335563Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8336145Z ^ 2025-05-07T19:54:41.8336447Z 2025-05-07T19:54:41.8336923Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.8337604Z 2025-05-07T19:54:41.8339128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8341217Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8341792Z ^ 2025-05-07T19:54:41.8342122Z 2025-05-07T19:54:41.8343767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8345725Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8346302Z ^ 2025-05-07T19:54:41.8346895Z 2025-05-07T19:54:41.8348534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8350560Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8351131Z ^ 2025-05-07T19:54:41.8351433Z 2025-05-07T19:54:41.8351908Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:41.8352581Z 2025-05-07T19:54:41.8353999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8356153Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8356744Z ^ 2025-05-07T19:54:41.8357324Z 2025-05-07T19:54:41.8358961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:41.8360922Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:41.8361483Z ^ 2025-05-07T19:54:41.8361812Z 2025-05-07T19:54:41.8414155Z [188/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:41.8589208Z [189/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:41.8741813Z [190/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:41.8916124Z [191/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:41.9081468Z [192/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:41.9215193Z [193/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:43.4448888Z [194/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:43.9804946Z [195/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:44.6354621Z [196/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:44.9206942Z [197/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:47.7155698Z [198/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:49.9973819Z [199/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:52.4774461Z [200/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:54.6804638Z [201/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:55.5929803Z [202/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.6301003Z [203/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:57.3250266Z [204/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:58.1235345Z [205/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:58.1740384Z [206/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:54:58.3447488Z [207/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:58.4864789Z [208/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:54:59.3876808Z [209/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:54:59.3899425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3901284Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:54:59.3901972Z ^ 2025-05-07T19:54:59.3902226Z 2025-05-07T19:54:59.3902645Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:59.3903288Z 2025-05-07T19:54:59.3904687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3906619Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3907144Z ^ 2025-05-07T19:54:59.3907474Z 2025-05-07T19:54:59.3908915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3910768Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3911241Z ^ 2025-05-07T19:54:59.3911484Z 2025-05-07T19:54:59.3912864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3914563Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3915032Z ^ 2025-05-07T19:54:59.3915249Z 2025-05-07T19:54:59.3917010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3918890Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:54:59.3919556Z ^ 2025-05-07T19:54:59.3919811Z 2025-05-07T19:54:59.3920237Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:59.3920880Z 2025-05-07T19:54:59.3922251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3924176Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3924680Z ^ 2025-05-07T19:54:59.3924963Z 2025-05-07T19:54:59.3926423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3928312Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3928832Z ^ 2025-05-07T19:54:59.3929089Z 2025-05-07T19:54:59.3930633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3932552Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3933124Z ^ 2025-05-07T19:54:59.3933381Z 2025-05-07T19:54:59.3934874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3936987Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:54:59.3937988Z ^ 2025-05-07T19:54:59.3938277Z 2025-05-07T19:54:59.3938725Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:59.3939415Z 2025-05-07T19:54:59.3940874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3942772Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3943327Z ^ 2025-05-07T19:54:59.3943603Z 2025-05-07T19:54:59.3945002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3947327Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3947856Z ^ 2025-05-07T19:54:59.3948141Z 2025-05-07T19:54:59.3949595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3951390Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3951925Z ^ 2025-05-07T19:54:59.3952193Z 2025-05-07T19:54:59.3953596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3955724Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:54:59.3956576Z ^ 2025-05-07T19:54:59.3957210Z 2025-05-07T19:54:59.3957647Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:59.3958270Z 2025-05-07T19:54:59.3959678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3961367Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3961912Z ^ 2025-05-07T19:54:59.3962181Z 2025-05-07T19:54:59.3963623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3965421Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3965978Z ^ 2025-05-07T19:54:59.3966267Z 2025-05-07T19:54:59.3967712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3969558Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3970114Z ^ 2025-05-07T19:54:59.3970385Z 2025-05-07T19:54:59.3971816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3973787Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:54:59.3974514Z ^ 2025-05-07T19:54:59.3974785Z 2025-05-07T19:54:59.3975215Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:59.3975853Z 2025-05-07T19:54:59.3977332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3979501Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3980031Z ^ 2025-05-07T19:54:59.3980319Z 2025-05-07T19:54:59.3981725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3983547Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3984057Z ^ 2025-05-07T19:54:59.3984320Z 2025-05-07T19:54:59.3985738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:59.3987612Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:59.3988149Z ^ 2025-05-07T19:54:59.3988419Z 2025-05-07T19:55:00.8831568Z [210/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.7882580Z [211/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:55:01.9746934Z [212/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:55:01.9770279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9772521Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9773059Z ^ 2025-05-07T19:55:01.9773349Z 2025-05-07T19:55:01.9773796Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9774433Z 2025-05-07T19:55:01.9775975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9777947Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9778531Z ^ 2025-05-07T19:55:01.9778829Z 2025-05-07T19:55:01.9780413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9782425Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9782988Z ^ 2025-05-07T19:55:01.9783271Z 2025-05-07T19:55:01.9784807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9786930Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9787472Z ^ 2025-05-07T19:55:01.9787806Z 2025-05-07T19:55:01.9788239Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9788892Z 2025-05-07T19:55:01.9793717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9795865Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9796528Z ^ 2025-05-07T19:55:01.9796818Z 2025-05-07T19:55:01.9798320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9800244Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9800805Z ^ 2025-05-07T19:55:01.9801102Z 2025-05-07T19:55:01.9802621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9804568Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9805145Z ^ 2025-05-07T19:55:01.9805464Z 2025-05-07T19:55:01.9805892Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9806555Z 2025-05-07T19:55:01.9808046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9810020Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9810561Z ^ 2025-05-07T19:55:01.9810866Z 2025-05-07T19:55:01.9812378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9814339Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9814878Z ^ 2025-05-07T19:55:01.9815174Z 2025-05-07T19:55:01.9816935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9818832Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9819406Z ^ 2025-05-07T19:55:01.9819699Z 2025-05-07T19:55:01.9820157Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9820769Z 2025-05-07T19:55:01.9822295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9824218Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9824761Z ^ 2025-05-07T19:55:01.9825074Z 2025-05-07T19:55:01.9826597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9828473Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9828974Z ^ 2025-05-07T19:55:01.9829278Z 2025-05-07T19:55:01.9830721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9832537Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9833099Z ^ 2025-05-07T19:55:01.9833394Z 2025-05-07T19:55:01.9834050Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:01.9834694Z 2025-05-07T19:55:01.9836283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9838193Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9838732Z ^ 2025-05-07T19:55:01.9839050Z 2025-05-07T19:55:01.9840564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:01.9842533Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:01.9843090Z ^ 2025-05-07T19:55:01.9843406Z 2025-05-07T19:55:03.5272925Z [213/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:03.5295007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5296924Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.5297673Z ^ 2025-05-07T19:55:03.5297961Z 2025-05-07T19:55:03.5298732Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.5299370Z 2025-05-07T19:55:03.5300832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5302600Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5303123Z ^ 2025-05-07T19:55:03.5303384Z 2025-05-07T19:55:03.5304842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5306567Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5307045Z ^ 2025-05-07T19:55:03.5307292Z 2025-05-07T19:55:03.5308801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5310457Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5310960Z ^ 2025-05-07T19:55:03.5311207Z 2025-05-07T19:55:03.5312551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5314343Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.5315029Z ^ 2025-05-07T19:55:03.5315289Z 2025-05-07T19:55:03.5315676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.5316407Z 2025-05-07T19:55:03.5317821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5319875Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5320358Z ^ 2025-05-07T19:55:03.5320652Z 2025-05-07T19:55:03.5321996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5323770Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5324258Z ^ 2025-05-07T19:55:03.5324514Z 2025-05-07T19:55:03.5325956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5327703Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5328214Z ^ 2025-05-07T19:55:03.5328459Z 2025-05-07T19:55:03.5329839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5331759Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.5332478Z ^ 2025-05-07T19:55:03.5332740Z 2025-05-07T19:55:03.5333144Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.5333782Z 2025-05-07T19:55:03.5335173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5337186Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5337731Z ^ 2025-05-07T19:55:03.5338032Z 2025-05-07T19:55:03.5339500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5341391Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5341953Z ^ 2025-05-07T19:55:03.5342240Z 2025-05-07T19:55:03.5343818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5345759Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5346317Z ^ 2025-05-07T19:55:03.5347084Z 2025-05-07T19:55:03.5348698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5350863Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.5351618Z ^ 2025-05-07T19:55:03.5351897Z 2025-05-07T19:55:03.5352315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.5352969Z 2025-05-07T19:55:03.5354494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5356548Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5357087Z ^ 2025-05-07T19:55:03.5357388Z 2025-05-07T19:55:03.5358928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5361120Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5361602Z ^ 2025-05-07T19:55:03.5361858Z 2025-05-07T19:55:03.5363422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5365393Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5365976Z ^ 2025-05-07T19:55:03.5366267Z 2025-05-07T19:55:03.5367827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5370043Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:03.5370818Z ^ 2025-05-07T19:55:03.5371120Z 2025-05-07T19:55:03.5371568Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:03.5372263Z 2025-05-07T19:55:03.5373823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5375819Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5376371Z ^ 2025-05-07T19:55:03.5376658Z 2025-05-07T19:55:03.5378240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5380533Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5381100Z ^ 2025-05-07T19:55:03.5381379Z 2025-05-07T19:55:03.5382976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.5384977Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:03.5385549Z ^ 2025-05-07T19:55:03.5385829Z 2025-05-07T19:55:03.9780414Z [214/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:55:04.0715531Z [215/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:55:04.6180837Z [216/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:05.1318566Z [217/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:06.3133597Z [218/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:55:06.4311739Z [219/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:55:07.4474998Z [220/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:08.5189602Z [221/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:55:09.5218282Z [222/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:55:09.6537227Z [223/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:55:11.7928102Z [224/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:55:12.1590168Z [225/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:12.2930484Z [226/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:12.5073312Z [227/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:12.5695528Z [228/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:13.2336781Z [229/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:13.2717934Z [230/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:13.9519141Z [231/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T19:55:15.0455846Z [232/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:15.2561774Z [233/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:16.0528082Z [234/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:17.1573267Z [235/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:17.5472478Z [236/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:17.5840156Z [237/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:17.5863981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5866163Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:17.5866961Z ^ 2025-05-07T19:55:17.5867259Z 2025-05-07T19:55:17.5867719Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.5868416Z 2025-05-07T19:55:17.5870017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5872021Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5872617Z ^ 2025-05-07T19:55:17.5872933Z 2025-05-07T19:55:17.5874500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5876588Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5877176Z ^ 2025-05-07T19:55:17.5877456Z 2025-05-07T19:55:17.5878939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5880866Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5881815Z ^ 2025-05-07T19:55:17.5882098Z 2025-05-07T19:55:17.5883699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5885585Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:17.5886297Z ^ 2025-05-07T19:55:17.5886588Z 2025-05-07T19:55:17.5887059Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.5887619Z 2025-05-07T19:55:17.5888996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5890805Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5891316Z ^ 2025-05-07T19:55:17.5891631Z 2025-05-07T19:55:17.5893018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5894847Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5895344Z ^ 2025-05-07T19:55:17.5895627Z 2025-05-07T19:55:17.5897045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5899031Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5899541Z ^ 2025-05-07T19:55:17.5899799Z 2025-05-07T19:55:17.5901216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5903180Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:17.5904164Z ^ 2025-05-07T19:55:17.5904419Z 2025-05-07T19:55:17.5904824Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.5905405Z 2025-05-07T19:55:17.5906757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5908699Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5909252Z ^ 2025-05-07T19:55:17.5909556Z 2025-05-07T19:55:17.5911107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5913076Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5913630Z ^ 2025-05-07T19:55:17.5913931Z 2025-05-07T19:55:17.5915343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5917274Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5917775Z ^ 2025-05-07T19:55:17.5918029Z 2025-05-07T19:55:17.5919430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5921362Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:17.5922232Z ^ 2025-05-07T19:55:17.5922533Z 2025-05-07T19:55:17.5923005Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.5923605Z 2025-05-07T19:55:17.5925156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5927159Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5927725Z ^ 2025-05-07T19:55:17.5928039Z 2025-05-07T19:55:17.5929593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5931539Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5932048Z ^ 2025-05-07T19:55:17.5932352Z 2025-05-07T19:55:17.5933864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5935738Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5936314Z ^ 2025-05-07T19:55:17.5936587Z 2025-05-07T19:55:17.5938150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5940269Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:17.5941066Z ^ 2025-05-07T19:55:17.5941364Z 2025-05-07T19:55:17.5941816Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.5942512Z 2025-05-07T19:55:17.5944082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5946205Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5946993Z ^ 2025-05-07T19:55:17.5947313Z 2025-05-07T19:55:17.5948847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5950835Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5951396Z ^ 2025-05-07T19:55:17.5951677Z 2025-05-07T19:55:17.5953270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5955239Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5955816Z ^ 2025-05-07T19:55:17.5956188Z 2025-05-07T19:55:19.0348700Z [238/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:19.4865332Z [239/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:26.0064153Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:26.0087703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0090157Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:26.0090903Z ^ 2025-05-07T19:55:26.0091215Z 2025-05-07T19:55:26.0091641Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.0092301Z 2025-05-07T19:55:26.0093827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0095682Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0096248Z ^ 2025-05-07T19:55:26.0096536Z 2025-05-07T19:55:26.0097826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0099535Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0100047Z ^ 2025-05-07T19:55:26.0100298Z 2025-05-07T19:55:26.0101729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0103546Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0104101Z ^ 2025-05-07T19:55:26.0104379Z 2025-05-07T19:55:26.0106033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0108304Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:26.0109081Z ^ 2025-05-07T19:55:26.0109365Z 2025-05-07T19:55:26.0109839Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.0110521Z 2025-05-07T19:55:26.0112005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0113899Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0114428Z ^ 2025-05-07T19:55:26.0114726Z 2025-05-07T19:55:26.0116489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0118508Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0119104Z ^ 2025-05-07T19:55:26.0119351Z 2025-05-07T19:55:26.0120695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0122336Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0122876Z ^ 2025-05-07T19:55:26.0123117Z 2025-05-07T19:55:26.0124234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0126036Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:26.0126704Z ^ 2025-05-07T19:55:26.0126954Z 2025-05-07T19:55:26.0127405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.0128093Z 2025-05-07T19:55:26.0129432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0131517Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0132051Z ^ 2025-05-07T19:55:26.0132337Z 2025-05-07T19:55:26.0133822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0135714Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0136268Z ^ 2025-05-07T19:55:26.0136544Z 2025-05-07T19:55:26.0138150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0140112Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0140680Z ^ 2025-05-07T19:55:26.0140956Z 2025-05-07T19:55:26.0142505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0144670Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:26.0145438Z ^ 2025-05-07T19:55:26.0145717Z 2025-05-07T19:55:26.0146166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.0147124Z 2025-05-07T19:55:26.0148959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0150943Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0151492Z ^ 2025-05-07T19:55:26.0151776Z 2025-05-07T19:55:26.0153312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0155231Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0155796Z ^ 2025-05-07T19:55:26.0156150Z 2025-05-07T19:55:26.0157686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0159572Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0160136Z ^ 2025-05-07T19:55:26.0160411Z 2025-05-07T19:55:26.0161852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0163947Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:26.0164714Z ^ 2025-05-07T19:55:26.0164994Z 2025-05-07T19:55:26.0165428Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.0166080Z 2025-05-07T19:55:26.0167569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0169368Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0169936Z ^ 2025-05-07T19:55:26.0170200Z 2025-05-07T19:55:26.0171698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0173494Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0173968Z ^ 2025-05-07T19:55:26.0174201Z 2025-05-07T19:55:26.0175636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.0177564Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.0178120Z ^ 2025-05-07T19:55:26.0178394Z 2025-05-07T19:55:35.6122000Z [241/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:35.6146979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6149078Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6149680Z ^ 2025-05-07T19:55:35.6150032Z 2025-05-07T19:55:35.6150530Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:35.6151212Z 2025-05-07T19:55:35.6152849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6155297Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6155901Z ^ 2025-05-07T19:55:35.6156328Z 2025-05-07T19:55:35.6157980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6160066Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6160628Z ^ 2025-05-07T19:55:35.6160894Z 2025-05-07T19:55:35.6162205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6163994Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6164468Z ^ 2025-05-07T19:55:35.6164761Z 2025-05-07T19:55:35.6165124Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:35.6165678Z 2025-05-07T19:55:35.6167202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6169183Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6169783Z ^ 2025-05-07T19:55:35.6170087Z 2025-05-07T19:55:35.6171956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6173964Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6174544Z ^ 2025-05-07T19:55:35.6174832Z 2025-05-07T19:55:35.6176434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6178413Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6178950Z ^ 2025-05-07T19:55:35.6179236Z 2025-05-07T19:55:35.6179672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:35.6180308Z 2025-05-07T19:55:35.6181859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6183839Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6184401Z ^ 2025-05-07T19:55:35.6184684Z 2025-05-07T19:55:35.6186245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6188207Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6188759Z ^ 2025-05-07T19:55:35.6189067Z 2025-05-07T19:55:35.6190609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6192548Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6193080Z ^ 2025-05-07T19:55:35.6193385Z 2025-05-07T19:55:35.6193804Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:35.6194651Z 2025-05-07T19:55:35.6196340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6198251Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6198824Z ^ 2025-05-07T19:55:35.6199094Z 2025-05-07T19:55:35.6200583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6202484Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6203054Z ^ 2025-05-07T19:55:35.6203336Z 2025-05-07T19:55:35.6204841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6206782Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6207304Z ^ 2025-05-07T19:55:35.6207604Z 2025-05-07T19:55:35.6208032Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:35.6208666Z 2025-05-07T19:55:35.6210208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6211961Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6212466Z ^ 2025-05-07T19:55:35.6212947Z 2025-05-07T19:55:35.6214521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:35.6216493Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:35.6217051Z ^ 2025-05-07T19:55:35.6217348Z 2025-05-07T19:55:41.5193076Z [242/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:52.3533970Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:55:59.5898862Z [244/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:55:59.5921969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.5923996Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.5924881Z ^ 2025-05-07T19:55:59.5928167Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:59.5931188Z 2025-05-07T19:55:59.5931658Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:59.5932320Z 2025-05-07T19:55:59.5933557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.5935427Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.5936290Z ^ 2025-05-07T19:55:59.5939542Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:59.5942624Z 2025-05-07T19:55:59.5943898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.5946022Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.5947193Z ^ 2025-05-07T19:55:59.5950473Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:59.5953540Z 2025-05-07T19:55:59.5954888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.5957094Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.5958029Z ^ 2025-05-07T19:55:59.5961397Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:59.5964466Z 2025-05-07T19:55:59.5966142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.5968149Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.5969064Z ^ 2025-05-07T19:55:59.5972367Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:59.5975527Z 2025-05-07T19:55:59.5976832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.5978734Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.5979573Z ^ 2025-05-07T19:55:59.5983008Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:59.5985798Z 2025-05-07T19:55:59.5987029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.5989190Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.5990084Z ^ 2025-05-07T19:55:59.5993321Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:59.5996441Z 2025-05-07T19:55:59.5997658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.5999634Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6000471Z ^ 2025-05-07T19:55:59.6003654Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:59.6006546Z 2025-05-07T19:55:59.6008070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6010164Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6011087Z ^ 2025-05-07T19:55:59.6014572Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:59.6017814Z 2025-05-07T19:55:59.6019156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6021093Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6021984Z ^ 2025-05-07T19:55:59.6025142Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:59.6028215Z 2025-05-07T19:55:59.6029510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6031521Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6032440Z ^ 2025-05-07T19:55:59.6055339Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:59.6057920Z 2025-05-07T19:55:59.6059075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6060823Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6061676Z ^ 2025-05-07T19:55:59.6064752Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:59.6067640Z 2025-05-07T19:55:59.6068892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6071152Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6072108Z ^ 2025-05-07T19:55:59.6075590Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:59.6079088Z 2025-05-07T19:55:59.6080419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6082484Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6083419Z ^ 2025-05-07T19:55:59.6086897Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:59.6090149Z 2025-05-07T19:55:59.6091477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6093540Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6094357Z ^ 2025-05-07T19:55:59.6097696Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:59.6101151Z 2025-05-07T19:55:59.6102431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6104448Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6105355Z ^ 2025-05-07T19:55:59.6108761Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:59.6111795Z 2025-05-07T19:55:59.6112970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6114765Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6115801Z ^ 2025-05-07T19:55:59.6118934Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:59.6121998Z 2025-05-07T19:55:59.6123343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6125385Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6126336Z ^ 2025-05-07T19:55:59.6129874Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:59.6133120Z 2025-05-07T19:55:59.6134468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6136506Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6137449Z ^ 2025-05-07T19:55:59.6140830Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:59.6144022Z 2025-05-07T19:55:59.6145368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6147543Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6148463Z ^ 2025-05-07T19:55:59.6151954Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:59.6155174Z 2025-05-07T19:55:59.6156571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6158583Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6159505Z ^ 2025-05-07T19:55:59.6163012Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:59.6165694Z 2025-05-07T19:55:59.6166888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6168786Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6169704Z ^ 2025-05-07T19:55:59.6173068Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:59.6176340Z 2025-05-07T19:55:59.6177664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6179487Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6180313Z ^ 2025-05-07T19:55:59.6183455Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:59.6186993Z 2025-05-07T19:55:59.6188306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6190339Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6191258Z ^ 2025-05-07T19:55:59.6194772Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:59.6198164Z 2025-05-07T19:55:59.6199468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6201480Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6202397Z ^ 2025-05-07T19:55:59.6205968Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:59.6209082Z 2025-05-07T19:55:59.6209577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:59.6210255Z 2025-05-07T19:55:59.6211547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6213543Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6214421Z ^ 2025-05-07T19:55:59.6217656Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:59.6220619Z 2025-05-07T19:55:59.6221598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6223476Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6224361Z ^ 2025-05-07T19:55:59.6227755Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:59.6231127Z 2025-05-07T19:55:59.6232400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6234385Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6235277Z ^ 2025-05-07T19:55:59.6238779Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:59.6241896Z 2025-05-07T19:55:59.6243186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6245133Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6246000Z ^ 2025-05-07T19:55:59.6249554Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:59.6252697Z 2025-05-07T19:55:59.6254033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6256019Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6256939Z ^ 2025-05-07T19:55:59.6260379Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:59.6263543Z 2025-05-07T19:55:59.6264879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6266887Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6267816Z ^ 2025-05-07T19:55:59.6271260Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:59.6274630Z 2025-05-07T19:55:59.6276056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6278067Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6278994Z ^ 2025-05-07T19:55:59.6282391Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:59.6285368Z 2025-05-07T19:55:59.6286642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6288607Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6289495Z ^ 2025-05-07T19:55:59.6292736Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:59.6295905Z 2025-05-07T19:55:59.6297141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6298941Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6299845Z ^ 2025-05-07T19:55:59.6303279Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:59.6306299Z 2025-05-07T19:55:59.6307585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6309615Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6310515Z ^ 2025-05-07T19:55:59.6313390Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:59.6316075Z 2025-05-07T19:55:59.6317048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6318503Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6319190Z ^ 2025-05-07T19:55:59.6321782Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:59.6324122Z 2025-05-07T19:55:59.6325142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6326700Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6327439Z ^ 2025-05-07T19:55:59.6330007Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:59.6332520Z 2025-05-07T19:55:59.6333510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6335120Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6335985Z ^ 2025-05-07T19:55:59.6339001Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:59.6341505Z 2025-05-07T19:55:59.6342574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6344176Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6344945Z ^ 2025-05-07T19:55:59.6348053Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:59.6350619Z 2025-05-07T19:55:59.6351636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6353506Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6354274Z ^ 2025-05-07T19:55:59.6357083Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:59.6359624Z 2025-05-07T19:55:59.6360648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6362247Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6363003Z ^ 2025-05-07T19:55:59.6365671Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:59.6368156Z 2025-05-07T19:55:59.6369445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6371043Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6371761Z ^ 2025-05-07T19:55:59.6374467Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:59.6377023Z 2025-05-07T19:55:59.6378056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6379659Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6380403Z ^ 2025-05-07T19:55:59.6383113Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:59.6385650Z 2025-05-07T19:55:59.6386673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6388434Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6389206Z ^ 2025-05-07T19:55:59.6391890Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:59.6394547Z 2025-05-07T19:55:59.6395641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6397532Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6398365Z ^ 2025-05-07T19:55:59.6401497Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:59.6404497Z 2025-05-07T19:55:59.6405910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6407693Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6408588Z ^ 2025-05-07T19:55:59.6411684Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:59.6414570Z 2025-05-07T19:55:59.6415688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6417613Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6418534Z ^ 2025-05-07T19:55:59.6421938Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:59.6424825Z 2025-05-07T19:55:59.6426164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6428134Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6429088Z ^ 2025-05-07T19:55:59.6431942Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:59.6435148Z 2025-05-07T19:55:59.6436593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6438579Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6439511Z ^ 2025-05-07T19:55:59.6442850Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:59.6446025Z 2025-05-07T19:55:59.6446745Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:59.6447435Z 2025-05-07T19:55:59.6449015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6450998Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6451905Z ^ 2025-05-07T19:55:59.6455280Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:59.6458358Z 2025-05-07T19:55:59.6459534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6461324Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6462248Z ^ 2025-05-07T19:55:59.6465601Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:59.6468672Z 2025-05-07T19:55:59.6470059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6472123Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6473315Z ^ 2025-05-07T19:55:59.6476856Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:59.6479975Z 2025-05-07T19:55:59.6481338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6483238Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6484144Z ^ 2025-05-07T19:55:59.6487491Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:59.6490588Z 2025-05-07T19:55:59.6492069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6494021Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6494962Z ^ 2025-05-07T19:55:59.6498391Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:59.6501518Z 2025-05-07T19:55:59.6502830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6504885Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6505739Z ^ 2025-05-07T19:55:59.6509031Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:59.6512221Z 2025-05-07T19:55:59.6513558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6515605Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6516654Z ^ 2025-05-07T19:55:59.6520221Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:59.6523223Z 2025-05-07T19:55:59.6524549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6526527Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6527434Z ^ 2025-05-07T19:55:59.6530928Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:59.6534158Z 2025-05-07T19:55:59.6535385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6537571Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6538500Z ^ 2025-05-07T19:55:59.6541832Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:59.6544908Z 2025-05-07T19:55:59.6546223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6548380Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6549324Z ^ 2025-05-07T19:55:59.6552690Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:59.6555779Z 2025-05-07T19:55:59.6557237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6559223Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6560107Z ^ 2025-05-07T19:55:59.6563326Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:59.6566817Z 2025-05-07T19:55:59.6568125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6570135Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6571077Z ^ 2025-05-07T19:55:59.6574278Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:59.6577488Z 2025-05-07T19:55:59.6578800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6580817Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6584185Z ^ 2025-05-07T19:55:59.6587632Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:59.6590864Z 2025-05-07T19:55:59.6592194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6594244Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6595142Z ^ 2025-05-07T19:55:59.6598698Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:59.6601869Z 2025-05-07T19:55:59.6603184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6605117Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6606007Z ^ 2025-05-07T19:55:59.6609209Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:59.6612424Z 2025-05-07T19:55:59.6613753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6615610Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6616465Z ^ 2025-05-07T19:55:59.6619804Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:59.6622787Z 2025-05-07T19:55:59.6624040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6625995Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6626932Z ^ 2025-05-07T19:55:59.6630552Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:59.6633792Z 2025-05-07T19:55:59.6635114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6637247Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6638158Z ^ 2025-05-07T19:55:59.6641545Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:59.6644777Z 2025-05-07T19:55:59.6646109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6648315Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6649161Z ^ 2025-05-07T19:55:59.6652704Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:59.6656198Z 2025-05-07T19:55:59.6657523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6659458Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6660383Z ^ 2025-05-07T19:55:59.6663692Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:59.6666842Z 2025-05-07T19:55:59.6668085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6669999Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6670877Z ^ 2025-05-07T19:55:59.6674378Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:59.6677488Z 2025-05-07T19:55:59.6678789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6680772Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6681672Z ^ 2025-05-07T19:55:59.6685134Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:59.6688404Z 2025-05-07T19:55:59.6689699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6691711Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6692601Z ^ 2025-05-07T19:55:59.6695840Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:59.6699191Z 2025-05-07T19:55:59.6700392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6702268Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6703094Z ^ 2025-05-07T19:55:59.6706455Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:59.6709516Z 2025-05-07T19:55:59.6709967Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:59.6710550Z 2025-05-07T19:55:59.6711783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6713661Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6714548Z ^ 2025-05-07T19:55:59.6718268Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:59.6721446Z 2025-05-07T19:55:59.6722747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6724722Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6725549Z ^ 2025-05-07T19:55:59.6728775Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:59.6731980Z 2025-05-07T19:55:59.6733270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6735211Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6736094Z ^ 2025-05-07T19:55:59.6739445Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:59.6742852Z 2025-05-07T19:55:59.6744174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6746069Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6747188Z ^ 2025-05-07T19:55:59.6750560Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:59.6753631Z 2025-05-07T19:55:59.6754941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6757026Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6757860Z ^ 2025-05-07T19:55:59.6761357Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:59.6764438Z 2025-05-07T19:55:59.6765728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6767647Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6768508Z ^ 2025-05-07T19:55:59.6771803Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:59.6775014Z 2025-05-07T19:55:59.6776340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6778306Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6779171Z ^ 2025-05-07T19:55:59.6782501Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:59.6785723Z 2025-05-07T19:55:59.6786988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6788825Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6789657Z ^ 2025-05-07T19:55:59.6792943Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:59.6796296Z 2025-05-07T19:55:59.6797608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6799613Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6800507Z ^ 2025-05-07T19:55:59.6804084Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:59.6807195Z 2025-05-07T19:55:59.6808518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6810510Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6811400Z ^ 2025-05-07T19:55:59.6814784Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:59.6817914Z 2025-05-07T19:55:59.6819167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6821113Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6821975Z ^ 2025-05-07T19:55:59.6825420Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:59.6828697Z 2025-05-07T19:55:59.6830013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6832285Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6833203Z ^ 2025-05-07T19:55:59.6836772Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:59.6839985Z 2025-05-07T19:55:59.6841303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6843207Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6844017Z ^ 2025-05-07T19:55:59.6847469Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:59.6850997Z 2025-05-07T19:55:59.6852364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6854254Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6855038Z ^ 2025-05-07T19:55:59.6858167Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:59.6861295Z 2025-05-07T19:55:59.6862580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6864571Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6865484Z ^ 2025-05-07T19:55:59.6868905Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:59.6871945Z 2025-05-07T19:55:59.6873245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6875523Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6876608Z ^ 2025-05-07T19:55:59.6880010Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:59.6883239Z 2025-05-07T19:55:59.6884525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6886527Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6887454Z ^ 2025-05-07T19:55:59.6890929Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:59.6894180Z 2025-05-07T19:55:59.6895668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6897662Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6898466Z ^ 2025-05-07T19:55:59.6901852Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:59.6905124Z 2025-05-07T19:55:59.6906458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6908457Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6909353Z ^ 2025-05-07T19:55:59.6912881Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:59.6916286Z 2025-05-07T19:55:59.6917511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6919453Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6920560Z ^ 2025-05-07T19:55:59.6923957Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:59.6927120Z 2025-05-07T19:55:59.6928441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6930439Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6931322Z ^ 2025-05-07T19:55:59.6934802Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:59.6938083Z 2025-05-07T19:55:59.6939622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6941649Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6942580Z ^ 2025-05-07T19:55:59.6946106Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:59.6949634Z 2025-05-07T19:55:59.6950990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6953013Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6953937Z ^ 2025-05-07T19:55:59.6957578Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:59.6960676Z 2025-05-07T19:55:59.6961917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6963766Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6964960Z ^ 2025-05-07T19:55:59.6968414Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:55:59.6971635Z 2025-05-07T19:55:59.6972093Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:59.6972787Z 2025-05-07T19:55:59.6974118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6976050Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6976785Z ^ 2025-05-07T19:55:59.6979634Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:55:59.6982396Z 2025-05-07T19:55:59.6983941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6985894Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6986791Z ^ 2025-05-07T19:55:59.6990088Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:55:59.6993151Z 2025-05-07T19:55:59.6994430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.6996547Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.6997397Z ^ 2025-05-07T19:55:59.7000718Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:55:59.7003867Z 2025-05-07T19:55:59.7005117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7006979Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7008057Z ^ 2025-05-07T19:55:59.7011336Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:55:59.7014256Z 2025-05-07T19:55:59.7015474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7017360Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7018221Z ^ 2025-05-07T19:55:59.7021546Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:55:59.7024493Z 2025-05-07T19:55:59.7025657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7027790Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7028685Z ^ 2025-05-07T19:55:59.7032002Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:55:59.7035215Z 2025-05-07T19:55:59.7036628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7038569Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7039446Z ^ 2025-05-07T19:55:59.7042758Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:55:59.7045994Z 2025-05-07T19:55:59.7047486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7049409Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7050269Z ^ 2025-05-07T19:55:59.7053682Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:55:59.7056916Z 2025-05-07T19:55:59.7058144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7060152Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7061020Z ^ 2025-05-07T19:55:59.7064046Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:55:59.7066943Z 2025-05-07T19:55:59.7068146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7070049Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7071193Z ^ 2025-05-07T19:55:59.7074350Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:55:59.7077539Z 2025-05-07T19:55:59.7078783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7080606Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7081451Z ^ 2025-05-07T19:55:59.7084541Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:55:59.7087780Z 2025-05-07T19:55:59.7089184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7091267Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7092219Z ^ 2025-05-07T19:55:59.7095832Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:55:59.7099208Z 2025-05-07T19:55:59.7100453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7102279Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7103166Z ^ 2025-05-07T19:55:59.7106387Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:55:59.7109593Z 2025-05-07T19:55:59.7110944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7112953Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7113894Z ^ 2025-05-07T19:55:59.7117721Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:55:59.7120847Z 2025-05-07T19:55:59.7122139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7124169Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7125089Z ^ 2025-05-07T19:55:59.7128510Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:55:59.7131487Z 2025-05-07T19:55:59.7132801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7134674Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7135449Z ^ 2025-05-07T19:55:59.7138209Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:55:59.7141149Z 2025-05-07T19:55:59.7142331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7144081Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7144965Z ^ 2025-05-07T19:55:59.7148374Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:55:59.7151263Z 2025-05-07T19:55:59.7152606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7154654Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7155568Z ^ 2025-05-07T19:55:59.7159565Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:55:59.7162878Z 2025-05-07T19:55:59.7164194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7166223Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7167139Z ^ 2025-05-07T19:55:59.7170702Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:55:59.7174003Z 2025-05-07T19:55:59.7175216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7177136Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7178066Z ^ 2025-05-07T19:55:59.7181528Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:55:59.7184959Z 2025-05-07T19:55:59.7186287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7188262Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7189181Z ^ 2025-05-07T19:55:59.7192524Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:55:59.7195422Z 2025-05-07T19:55:59.7196844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7198624Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7199435Z ^ 2025-05-07T19:55:59.7202845Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:55:59.7206132Z 2025-05-07T19:55:59.7207481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:59.7209514Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:55:59.7210447Z ^ 2025-05-07T19:55:59.7213981Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:55:59.7217305Z 2025-05-07T19:56:07.5464020Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:56:07.5916488Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:56:10.0637267Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:56:10.0662241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0664321Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0664843Z ^ 2025-05-07T19:56:10.0665161Z 2025-05-07T19:56:10.0665618Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.0666232Z 2025-05-07T19:56:10.0667753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0669628Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0670186Z ^ 2025-05-07T19:56:10.0670483Z 2025-05-07T19:56:10.0672011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0674326Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0674867Z ^ 2025-05-07T19:56:10.0675382Z 2025-05-07T19:56:10.0676980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0678930Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0679489Z ^ 2025-05-07T19:56:10.0679779Z 2025-05-07T19:56:10.0680232Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.0680871Z 2025-05-07T19:56:10.0682433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0684410Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0685003Z ^ 2025-05-07T19:56:10.0685305Z 2025-05-07T19:56:10.0686889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0688714Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0689233Z ^ 2025-05-07T19:56:10.0689526Z 2025-05-07T19:56:10.0690958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0693077Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0693604Z ^ 2025-05-07T19:56:10.0693922Z 2025-05-07T19:56:10.0694335Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.0694945Z 2025-05-07T19:56:10.0696429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0698293Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0698844Z ^ 2025-05-07T19:56:10.0699124Z 2025-05-07T19:56:10.0700588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0702469Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0703021Z ^ 2025-05-07T19:56:10.0703300Z 2025-05-07T19:56:10.0704758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0706641Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0707162Z ^ 2025-05-07T19:56:10.0707463Z 2025-05-07T19:56:10.0707864Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.0708482Z 2025-05-07T19:56:10.0709963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0711799Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0712331Z ^ 2025-05-07T19:56:10.0712590Z 2025-05-07T19:56:10.0714126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0716321Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0716834Z ^ 2025-05-07T19:56:10.0717095Z 2025-05-07T19:56:10.0718519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0720174Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0720658Z ^ 2025-05-07T19:56:10.0720889Z 2025-05-07T19:56:10.0721233Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.0721758Z 2025-05-07T19:56:10.0722994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0724856Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0725410Z ^ 2025-05-07T19:56:10.0725686Z 2025-05-07T19:56:10.0727135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:10.0729086Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:10.0729672Z ^ 2025-05-07T19:56:10.0729962Z 2025-05-07T19:56:24.0704225Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:24.0723812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0725751Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:24.0726397Z ^ 2025-05-07T19:56:24.0726641Z 2025-05-07T19:56:24.0727072Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:24.0727615Z 2025-05-07T19:56:24.0728879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0730557Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0731054Z ^ 2025-05-07T19:56:24.0731333Z 2025-05-07T19:56:24.0732677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0734326Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0734784Z ^ 2025-05-07T19:56:24.0735053Z 2025-05-07T19:56:24.0736520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0738127Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0738628Z ^ 2025-05-07T19:56:24.0738871Z 2025-05-07T19:56:24.0740071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0741740Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:24.0742357Z ^ 2025-05-07T19:56:24.0742600Z 2025-05-07T19:56:24.0742963Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:24.0743531Z 2025-05-07T19:56:24.0744758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0746374Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0747123Z ^ 2025-05-07T19:56:24.0747382Z 2025-05-07T19:56:24.0748569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0750162Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0750615Z ^ 2025-05-07T19:56:24.0750847Z 2025-05-07T19:56:24.0752076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0753666Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0754173Z ^ 2025-05-07T19:56:24.0754427Z 2025-05-07T19:56:24.0755687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0757725Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:24.0760843Z ^ 2025-05-07T19:56:24.0761088Z 2025-05-07T19:56:24.0761463Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:24.0762050Z 2025-05-07T19:56:24.0763323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0764906Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0765403Z ^ 2025-05-07T19:56:24.0765673Z 2025-05-07T19:56:24.0766882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0768439Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0768889Z ^ 2025-05-07T19:56:24.0769119Z 2025-05-07T19:56:24.0770336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0771833Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0772313Z ^ 2025-05-07T19:56:24.0772547Z 2025-05-07T19:56:24.0774039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0775746Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:24.0776407Z ^ 2025-05-07T19:56:24.0776654Z 2025-05-07T19:56:24.0777034Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:24.0777587Z 2025-05-07T19:56:24.0778798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0780367Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0780819Z ^ 2025-05-07T19:56:24.0781088Z 2025-05-07T19:56:24.0782295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0783813Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0784247Z ^ 2025-05-07T19:56:24.0784473Z 2025-05-07T19:56:24.0785700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0787230Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0787711Z ^ 2025-05-07T19:56:24.0787939Z 2025-05-07T19:56:24.0789169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0790873Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:24.0791547Z ^ 2025-05-07T19:56:24.0791795Z 2025-05-07T19:56:24.0792178Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:24.0792914Z 2025-05-07T19:56:24.0794129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0795758Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0796366Z ^ 2025-05-07T19:56:24.0796598Z 2025-05-07T19:56:24.0797767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0799309Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0799809Z ^ 2025-05-07T19:56:24.0800038Z 2025-05-07T19:56:24.0801305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:24.0802778Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:24.0803244Z ^ 2025-05-07T19:56:24.0803473Z 2025-05-07T19:56:26.1659641Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:26.1681886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1684274Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:26.1685180Z ^ 2025-05-07T19:56:26.1685472Z 2025-05-07T19:56:26.1685911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:26.1686583Z 2025-05-07T19:56:26.1688275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1690175Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1690788Z ^ 2025-05-07T19:56:26.1691070Z 2025-05-07T19:56:26.1692540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1694550Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1695105Z ^ 2025-05-07T19:56:26.1695383Z 2025-05-07T19:56:26.1696801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1698702Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1699246Z ^ 2025-05-07T19:56:26.1699493Z 2025-05-07T19:56:26.1701160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1703320Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:26.1704105Z ^ 2025-05-07T19:56:26.1704360Z 2025-05-07T19:56:26.1704772Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:26.1705379Z 2025-05-07T19:56:26.1706727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1708611Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1709206Z ^ 2025-05-07T19:56:26.1709491Z 2025-05-07T19:56:26.1711026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1712871Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1713421Z ^ 2025-05-07T19:56:26.1713702Z 2025-05-07T19:56:26.1715291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1717400Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1717985Z ^ 2025-05-07T19:56:26.1718276Z 2025-05-07T19:56:26.1719746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1721795Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:26.1722518Z ^ 2025-05-07T19:56:26.1722837Z 2025-05-07T19:56:26.1723272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:26.1724063Z 2025-05-07T19:56:26.1725577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1727529Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1728066Z ^ 2025-05-07T19:56:26.1728355Z 2025-05-07T19:56:26.1729879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1731850Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1732416Z ^ 2025-05-07T19:56:26.1732705Z 2025-05-07T19:56:26.1734263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1736290Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1736878Z ^ 2025-05-07T19:56:26.1737161Z 2025-05-07T19:56:26.1738732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1740861Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:26.1741592Z ^ 2025-05-07T19:56:26.1741893Z 2025-05-07T19:56:26.1742293Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:26.1743079Z 2025-05-07T19:56:26.1744506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1746132Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1746896Z ^ 2025-05-07T19:56:26.1747182Z 2025-05-07T19:56:26.1748455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1750213Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1750739Z ^ 2025-05-07T19:56:26.1751026Z 2025-05-07T19:56:26.1752451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1754184Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1754746Z ^ 2025-05-07T19:56:26.1755035Z 2025-05-07T19:56:26.1756642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1758718Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:26.1759447Z ^ 2025-05-07T19:56:26.1759749Z 2025-05-07T19:56:26.1760203Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:26.1760795Z 2025-05-07T19:56:26.1762245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1764206Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1765025Z ^ 2025-05-07T19:56:26.1765293Z 2025-05-07T19:56:26.1766626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1768664Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1769189Z ^ 2025-05-07T19:56:26.1769497Z 2025-05-07T19:56:26.1770932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:26.1772647Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:26.1773142Z ^ 2025-05-07T19:56:26.1773374Z 2025-05-07T19:56:27.1920872Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:27.1933073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1934188Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:27.1934644Z ^ 2025-05-07T19:56:27.1934813Z 2025-05-07T19:56:27.1935083Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:27.1935519Z 2025-05-07T19:56:27.1936313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1937415Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1937736Z ^ 2025-05-07T19:56:27.1937924Z 2025-05-07T19:56:27.1938710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1939737Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1940054Z ^ 2025-05-07T19:56:27.1940237Z 2025-05-07T19:56:27.1941029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1942024Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1942362Z ^ 2025-05-07T19:56:27.1942514Z 2025-05-07T19:56:27.1943317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1944406Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:27.1944850Z ^ 2025-05-07T19:56:27.1945010Z 2025-05-07T19:56:27.1945276Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:27.1945704Z 2025-05-07T19:56:27.1947338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1948399Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1948718Z ^ 2025-05-07T19:56:27.1948904Z 2025-05-07T19:56:27.1949685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1950703Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1951012Z ^ 2025-05-07T19:56:27.1951169Z 2025-05-07T19:56:27.1951975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1952970Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1953302Z ^ 2025-05-07T19:56:27.1953459Z 2025-05-07T19:56:27.1954261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1955355Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:27.1955791Z ^ 2025-05-07T19:56:27.1956041Z 2025-05-07T19:56:27.1956289Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:27.1956673Z 2025-05-07T19:56:27.1957458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1958491Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1958931Z ^ 2025-05-07T19:56:27.1959121Z 2025-05-07T19:56:27.1959897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1961164Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1961479Z ^ 2025-05-07T19:56:27.1961640Z 2025-05-07T19:56:27.1962445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1963448Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1963779Z ^ 2025-05-07T19:56:27.1963940Z 2025-05-07T19:56:27.1964748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1965839Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:27.1966278Z ^ 2025-05-07T19:56:27.1966441Z 2025-05-07T19:56:27.1966686Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:27.1967059Z 2025-05-07T19:56:27.1967838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1968852Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1969174Z ^ 2025-05-07T19:56:27.1969360Z 2025-05-07T19:56:27.1970259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1971296Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1971613Z ^ 2025-05-07T19:56:27.1971778Z 2025-05-07T19:56:27.1972585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1973579Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1973910Z ^ 2025-05-07T19:56:27.1974070Z 2025-05-07T19:56:27.1974884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1975965Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:27.1976402Z ^ 2025-05-07T19:56:27.1976564Z 2025-05-07T19:56:27.1976808Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:27.1977181Z 2025-05-07T19:56:27.1977961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1978983Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1979293Z ^ 2025-05-07T19:56:27.1979477Z 2025-05-07T19:56:27.1980260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1981255Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1981586Z ^ 2025-05-07T19:56:27.1981789Z 2025-05-07T19:56:27.1982599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.1983635Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:27.1983966Z ^ 2025-05-07T19:56:27.1984127Z 2025-05-07T19:56:30.8878484Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:30.8900425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8902540Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:30.8903237Z ^ 2025-05-07T19:56:30.8903532Z 2025-05-07T19:56:30.8903957Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.8904562Z 2025-05-07T19:56:30.8906025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8907812Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8908544Z ^ 2025-05-07T19:56:30.8908814Z 2025-05-07T19:56:30.8910209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8912140Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8931362Z ^ 2025-05-07T19:56:30.8931716Z 2025-05-07T19:56:30.8933011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8934699Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8935222Z ^ 2025-05-07T19:56:30.8935563Z 2025-05-07T19:56:30.8936978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8939002Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:30.8939718Z ^ 2025-05-07T19:56:30.8940011Z 2025-05-07T19:56:30.8940427Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.8941042Z 2025-05-07T19:56:30.8942503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8944307Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8944841Z ^ 2025-05-07T19:56:30.8945105Z 2025-05-07T19:56:30.8947067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8948906Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8949454Z ^ 2025-05-07T19:56:30.8949718Z 2025-05-07T19:56:30.8951104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8952911Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8953413Z ^ 2025-05-07T19:56:30.8953704Z 2025-05-07T19:56:30.8955109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8957180Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:30.8957879Z ^ 2025-05-07T19:56:30.8958168Z 2025-05-07T19:56:30.8958575Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.8959184Z 2025-05-07T19:56:30.8960622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8962405Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8962942Z ^ 2025-05-07T19:56:30.8963216Z 2025-05-07T19:56:30.8964639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8966444Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8966981Z ^ 2025-05-07T19:56:30.8967423Z 2025-05-07T19:56:30.8968827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8970738Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8971243Z ^ 2025-05-07T19:56:30.8971529Z 2025-05-07T19:56:30.8972942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8974916Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:30.8975617Z ^ 2025-05-07T19:56:30.8975912Z 2025-05-07T19:56:30.8976332Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.8976948Z 2025-05-07T19:56:30.8978356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8980079Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8980614Z ^ 2025-05-07T19:56:30.8980884Z 2025-05-07T19:56:30.8982199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8983731Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8984169Z ^ 2025-05-07T19:56:30.8984428Z 2025-05-07T19:56:30.8985801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8987354Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8987805Z ^ 2025-05-07T19:56:30.8988074Z 2025-05-07T19:56:30.8989278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8991157Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:30.8991721Z ^ 2025-05-07T19:56:30.8991948Z 2025-05-07T19:56:30.8992315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.8992804Z 2025-05-07T19:56:30.8993947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8995398Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8995870Z ^ 2025-05-07T19:56:30.8996261Z 2025-05-07T19:56:30.8997514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.8999374Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.8999885Z ^ 2025-05-07T19:56:30.9000140Z 2025-05-07T19:56:30.9001225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.9002634Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.9003062Z ^ 2025-05-07T19:56:30.9003482Z 2025-05-07T19:57:05.6153915Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:07.7256704Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:57:09.2147226Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:09.9608750Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:57:11.9627779Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:14.2800598Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:15.0014623Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:16.3853545Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:18.0323431Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:19.2680447Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:26.9013960Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:57:30.2389966Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:57:32.7302657Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:33.7667577Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:42.4629732Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:57:42.4649258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:42.4650552Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:42.4651068Z ^ 2025-05-07T19:57:42.4651275Z 2025-05-07T19:57:42.4651598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:42.4652113Z 2025-05-07T19:57:42.4653068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:42.4654336Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:42.4654757Z ^ 2025-05-07T19:57:42.4654980Z 2025-05-07T19:57:42.4655302Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:42.4655782Z 2025-05-07T19:57:42.4656735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:42.4658235Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:42.4658675Z ^ 2025-05-07T19:57:42.4658870Z 2025-05-07T19:57:42.4659193Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:42.4659815Z 2025-05-07T19:57:42.4660775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:42.4662041Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:42.4662447Z ^ 2025-05-07T19:57:42.4662652Z 2025-05-07T19:57:42.4663002Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:42.4663467Z 2025-05-07T19:57:46.2544201Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:46.6591582Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:48.7021213Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:50.7182847Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:53.0950713Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:53.0971883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.0973689Z int error_code = 0; 2025-05-07T19:57:53.0974138Z ^ 2025-05-07T19:57:53.0974371Z 2025-05-07T19:57:53.0974804Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.0975436Z 2025-05-07T19:57:53.0976589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.0978131Z int64_t error_value; 2025-05-07T19:57:53.0978596Z ^ 2025-05-07T19:57:53.0978817Z 2025-05-07T19:57:53.0979986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.0981572Z int error_code = 0; 2025-05-07T19:57:53.0982029Z ^ 2025-05-07T19:57:53.0982334Z 2025-05-07T19:57:53.0983623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.0985256Z int64_t error_value; 2025-05-07T19:57:53.0985712Z ^ 2025-05-07T19:57:53.0985915Z 2025-05-07T19:57:53.0987091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.0988638Z int error_code = 0; 2025-05-07T19:57:53.0988960Z ^ 2025-05-07T19:57:53.0989148Z 2025-05-07T19:57:53.0990263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.0991938Z int64_t error_value; 2025-05-07T19:57:53.0992341Z ^ 2025-05-07T19:57:53.0992550Z 2025-05-07T19:57:53.0994010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.0995477Z int error_code = 0; 2025-05-07T19:57:53.0996076Z ^ 2025-05-07T19:57:53.0996271Z 2025-05-07T19:57:53.0997525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.0998953Z int64_t error_value; 2025-05-07T19:57:53.0999302Z ^ 2025-05-07T19:57:53.0999476Z 2025-05-07T19:57:53.1000650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1001944Z int error_code = 0; 2025-05-07T19:57:53.1002347Z ^ 2025-05-07T19:57:53.1002550Z 2025-05-07T19:57:53.1002943Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.1003489Z 2025-05-07T19:57:53.1004775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1006258Z int64_t error_value; 2025-05-07T19:57:53.1006619Z ^ 2025-05-07T19:57:53.1006829Z 2025-05-07T19:57:53.1007929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1009491Z int error_code = 0; 2025-05-07T19:57:53.1009906Z ^ 2025-05-07T19:57:53.1010313Z 2025-05-07T19:57:53.1011521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1012867Z int64_t error_value; 2025-05-07T19:57:53.1013247Z ^ 2025-05-07T19:57:53.1013448Z 2025-05-07T19:57:53.1014637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1016154Z int error_code = 0; 2025-05-07T19:57:53.1016586Z ^ 2025-05-07T19:57:53.1016797Z 2025-05-07T19:57:53.1018011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1019511Z int64_t error_value; 2025-05-07T19:57:53.1019909Z ^ 2025-05-07T19:57:53.1020106Z 2025-05-07T19:57:53.1021247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1022675Z int error_code = 0; 2025-05-07T19:57:53.1023106Z ^ 2025-05-07T19:57:53.1023298Z 2025-05-07T19:57:53.1024544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1026144Z int64_t error_value; 2025-05-07T19:57:53.1026603Z ^ 2025-05-07T19:57:53.1026839Z 2025-05-07T19:57:53.1028022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1029728Z int error_code = 0; 2025-05-07T19:57:53.1030142Z ^ 2025-05-07T19:57:53.1030553Z 2025-05-07T19:57:53.1030963Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.1031598Z 2025-05-07T19:57:53.1032931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1034696Z int64_t error_value; 2025-05-07T19:57:53.1035141Z ^ 2025-05-07T19:57:53.1035366Z 2025-05-07T19:57:53.1036824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1038508Z int error_code = 0; 2025-05-07T19:57:53.1038945Z ^ 2025-05-07T19:57:53.1039144Z 2025-05-07T19:57:53.1040429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1042038Z int64_t error_value; 2025-05-07T19:57:53.1042453Z ^ 2025-05-07T19:57:53.1042692Z 2025-05-07T19:57:53.1043853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1045364Z int error_code = 0; 2025-05-07T19:57:53.1045778Z ^ 2025-05-07T19:57:53.1046002Z 2025-05-07T19:57:53.1047488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1048918Z int64_t error_value; 2025-05-07T19:57:53.1049610Z ^ 2025-05-07T19:57:53.1049831Z 2025-05-07T19:57:53.1051068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1052717Z int error_code = 0; 2025-05-07T19:57:53.1053172Z ^ 2025-05-07T19:57:53.1053371Z 2025-05-07T19:57:53.1054648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1056337Z int64_t error_value; 2025-05-07T19:57:53.1056786Z ^ 2025-05-07T19:57:53.1057008Z 2025-05-07T19:57:53.1058298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1060013Z int error_code = 0; 2025-05-07T19:57:53.1060435Z ^ 2025-05-07T19:57:53.1060675Z 2025-05-07T19:57:53.1061105Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.1061715Z 2025-05-07T19:57:53.1062936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1064461Z int64_t error_value; 2025-05-07T19:57:53.1064879Z ^ 2025-05-07T19:57:53.1065077Z 2025-05-07T19:57:53.1066238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1067853Z int error_code = 0; 2025-05-07T19:57:53.1068276Z ^ 2025-05-07T19:57:53.1068457Z 2025-05-07T19:57:53.1069488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1071196Z int64_t error_value; 2025-05-07T19:57:53.1071600Z ^ 2025-05-07T19:57:53.1071827Z 2025-05-07T19:57:53.1073041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1075700Z int error_code = 0; 2025-05-07T19:57:53.1076259Z ^ 2025-05-07T19:57:53.1076450Z 2025-05-07T19:57:53.1077736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1079297Z int64_t error_value; 2025-05-07T19:57:53.1079702Z ^ 2025-05-07T19:57:53.1079935Z 2025-05-07T19:57:53.1081231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:53.1082739Z int error_code = 0; 2025-05-07T19:57:53.1083141Z ^ 2025-05-07T19:57:53.1083343Z 2025-05-07T19:57:53.1084511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:53.1086098Z int64_t error_value; 2025-05-07T19:57:53.1086478Z ^ 2025-05-07T19:57:53.1086652Z 2025-05-07T19:57:56.4635240Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:58.2938036Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:58:00.8630148Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:58:02.1982400Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:58:04.1526721Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:08.5678868Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:58:14.4690115Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:14.6633892Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:15.7213348Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:17.1259915Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:21.9234394Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:58:21.9257176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:21.9259263Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:21.9259838Z ^ 2025-05-07T19:58:21.9260096Z 2025-05-07T19:58:21.9260542Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.9261232Z 2025-05-07T19:58:21.9262610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:21.9264427Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:21.9264964Z ^ 2025-05-07T19:58:21.9265243Z 2025-05-07T19:58:21.9265698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.9266336Z 2025-05-07T19:58:21.9267621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:21.9269284Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:21.9269822Z ^ 2025-05-07T19:58:21.9270076Z 2025-05-07T19:58:21.9270513Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.9271176Z 2025-05-07T19:58:21.9272500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:21.9274231Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:21.9274777Z ^ 2025-05-07T19:58:21.9275026Z 2025-05-07T19:58:21.9275460Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:21.9276535Z 2025-05-07T19:58:25.2130892Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:30.8791633Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:34.8596601Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:38.6318171Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:39.0704789Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:41.7358782Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:42.6939329Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:45.3467106Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:48.5880957Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:58:48.5902637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5904600Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5905164Z ^ 2025-05-07T19:58:48.5905448Z 2025-05-07T19:58:48.5905900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:48.5906551Z 2025-05-07T19:58:48.5907976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5910073Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5910727Z ^ 2025-05-07T19:58:48.5910983Z 2025-05-07T19:58:48.5912348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5914185Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5914757Z ^ 2025-05-07T19:58:48.5915039Z 2025-05-07T19:58:48.5916674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5918478Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5919055Z ^ 2025-05-07T19:58:48.5919306Z 2025-05-07T19:58:48.5919708Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:48.5920344Z 2025-05-07T19:58:48.5921818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5923732Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5924253Z ^ 2025-05-07T19:58:48.5924556Z 2025-05-07T19:58:48.5926263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5928232Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5928792Z ^ 2025-05-07T19:58:48.5929064Z 2025-05-07T19:58:48.5930549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5932392Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5932932Z ^ 2025-05-07T19:58:48.5933215Z 2025-05-07T19:58:48.5933638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:48.5934248Z 2025-05-07T19:58:48.5935748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5937671Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5938263Z ^ 2025-05-07T19:58:48.5938573Z 2025-05-07T19:58:48.5939938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5941784Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5942300Z ^ 2025-05-07T19:58:48.5942598Z 2025-05-07T19:58:48.5944002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5945896Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5946793Z ^ 2025-05-07T19:58:48.5947067Z 2025-05-07T19:58:48.5947485Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:48.5948367Z 2025-05-07T19:58:48.5949963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5951978Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5952564Z ^ 2025-05-07T19:58:48.5952853Z 2025-05-07T19:58:48.5954158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5955976Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5956599Z ^ 2025-05-07T19:58:48.5956930Z 2025-05-07T19:58:48.5958368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5960221Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5960764Z ^ 2025-05-07T19:58:48.5961078Z 2025-05-07T19:58:48.5961471Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:48.5962078Z 2025-05-07T19:58:48.5963495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5965347Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5965901Z ^ 2025-05-07T19:58:48.5966169Z 2025-05-07T19:58:48.5967940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:48.5969909Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:48.5970441Z ^ 2025-05-07T19:58:48.5970771Z 2025-05-07T19:58:50.3132071Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:58:50.9277620Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:51.9343766Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:55.3705442Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:56.4961838Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:57.5008082Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:57.5388579Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:01.4695507Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:01.7132070Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:03.6229478Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:59:08.4119453Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:11.0300463Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:59:12.5419083Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:13.0080714Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:19.1475048Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:19.7501163Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:19.8699837Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:20.0468887Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:25.4772889Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:25.7784449Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:26.5374048Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:27.6185170Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:59:30.9720820Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:36.5359832Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:59:37.5335012Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:38.3437596Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:39.4801461Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:39.8216105Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:43.9512707Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:43.9535527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9537586Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:43.9538325Z ^ 2025-05-07T19:59:43.9538649Z 2025-05-07T19:59:43.9539059Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:43.9539648Z 2025-05-07T19:59:43.9541053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9542878Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9543429Z ^ 2025-05-07T19:59:43.9543696Z 2025-05-07T19:59:43.9547519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9549667Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9550207Z ^ 2025-05-07T19:59:43.9550476Z 2025-05-07T19:59:43.9551917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9553679Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9554203Z ^ 2025-05-07T19:59:43.9554461Z 2025-05-07T19:59:43.9555954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9557969Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:43.9558741Z ^ 2025-05-07T19:59:43.9559022Z 2025-05-07T19:59:43.9559420Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:43.9560017Z 2025-05-07T19:59:43.9561526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9563336Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9563857Z ^ 2025-05-07T19:59:43.9564114Z 2025-05-07T19:59:43.9565706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9567494Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9568062Z ^ 2025-05-07T19:59:43.9568575Z 2025-05-07T19:59:43.9570035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9572104Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9572683Z ^ 2025-05-07T19:59:43.9572932Z 2025-05-07T19:59:43.9574306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9576209Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:43.9576984Z ^ 2025-05-07T19:59:43.9577288Z 2025-05-07T19:59:43.9577708Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:43.9578355Z 2025-05-07T19:59:43.9579899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9581699Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9582296Z ^ 2025-05-07T19:59:43.9582567Z 2025-05-07T19:59:43.9583991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9585864Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9586440Z ^ 2025-05-07T19:59:43.9586728Z 2025-05-07T19:59:43.9588534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9590365Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9590905Z ^ 2025-05-07T19:59:43.9591160Z 2025-05-07T19:59:43.9592602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9594720Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:43.9595353Z ^ 2025-05-07T19:59:43.9595628Z 2025-05-07T19:59:43.9596174Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:43.9596751Z 2025-05-07T19:59:43.9598092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9599711Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9600239Z ^ 2025-05-07T19:59:43.9600501Z 2025-05-07T19:59:43.9601988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9603718Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9604276Z ^ 2025-05-07T19:59:43.9604596Z 2025-05-07T19:59:43.9606023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9607788Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9608328Z ^ 2025-05-07T19:59:43.9608774Z 2025-05-07T19:59:43.9610224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9612311Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:43.9613026Z ^ 2025-05-07T19:59:43.9613349Z 2025-05-07T19:59:43.9613745Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:43.9614315Z 2025-05-07T19:59:43.9615739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9617391Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9617930Z ^ 2025-05-07T19:59:43.9618213Z 2025-05-07T19:59:43.9619711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9621472Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9622014Z ^ 2025-05-07T19:59:43.9622310Z 2025-05-07T19:59:43.9623742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:43.9625471Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:43.9625998Z ^ 2025-05-07T19:59:43.9626258Z 2025-05-07T19:59:49.3626531Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:56.1780891Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:56.1810827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1813644Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:56.1814652Z ^ 2025-05-07T19:59:56.1815035Z 2025-05-07T19:59:56.1815608Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:56.1816528Z 2025-05-07T19:59:56.1818582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1821280Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1822033Z ^ 2025-05-07T19:59:56.1822452Z 2025-05-07T19:59:56.1824364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1827158Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1827839Z ^ 2025-05-07T19:59:56.1828175Z 2025-05-07T19:59:56.1830077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1832475Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1833168Z ^ 2025-05-07T19:59:56.1833518Z 2025-05-07T19:59:56.1835590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1838405Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:56.1839338Z ^ 2025-05-07T19:59:56.1839698Z 2025-05-07T19:59:56.1840271Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:56.1841070Z 2025-05-07T19:59:56.1842953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1844731Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1845234Z ^ 2025-05-07T19:59:56.1845497Z 2025-05-07T19:59:56.1847314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1849174Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1849671Z ^ 2025-05-07T19:59:56.1849941Z 2025-05-07T19:59:56.1851347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1853147Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1853670Z ^ 2025-05-07T19:59:56.1853929Z 2025-05-07T19:59:56.1855391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1857328Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:56.1858081Z ^ 2025-05-07T19:59:56.1858356Z 2025-05-07T19:59:56.1858803Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:56.1859413Z 2025-05-07T19:59:56.1860818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1862659Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1863182Z ^ 2025-05-07T19:59:56.1863447Z 2025-05-07T19:59:56.1864904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1866795Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1867297Z ^ 2025-05-07T19:59:56.1867575Z 2025-05-07T19:59:56.1868993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1871121Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1871629Z ^ 2025-05-07T19:59:56.1871896Z 2025-05-07T19:59:56.1873406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1875397Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:56.1876280Z ^ 2025-05-07T19:59:56.1876543Z 2025-05-07T19:59:56.1877008Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:56.1877659Z 2025-05-07T19:59:56.1879134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1880977Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1881501Z ^ 2025-05-07T19:59:56.1881809Z 2025-05-07T19:59:56.1883174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1884952Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1885438Z ^ 2025-05-07T19:59:56.1885712Z 2025-05-07T19:59:56.1887200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1889071Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1889595Z ^ 2025-05-07T19:59:56.1889861Z 2025-05-07T19:59:56.1891358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1893406Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:56.1894150Z ^ 2025-05-07T19:59:56.1894427Z 2025-05-07T19:59:56.1894865Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:56.1895491Z 2025-05-07T19:59:56.1897004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1898854Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1899351Z ^ 2025-05-07T19:59:56.1899631Z 2025-05-07T19:59:56.1901055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1902730Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1903261Z ^ 2025-05-07T19:59:56.1903535Z 2025-05-07T19:59:56.1904949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:56.1906820Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:56.1907313Z ^ 2025-05-07T19:59:56.1907567Z 2025-05-07T19:59:57.3447800Z [323/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:59:59.7474073Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T20:00:09.5934303Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T20:00:43.4197290Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:50.5960393Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:54.0578162Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:54.6264275Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T20:00:55.0475615Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T20:00:55.2294257Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:56.8469676Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T20:00:59.2446356Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T20:01:02.3728653Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:07.2447236Z [335/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T20:01:09.8251591Z [336/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T20:01:09.8404259Z [337/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T20:01:10.7383285Z [338/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:12.9727032Z [339/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:14.8697398Z [340/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:17.5028880Z [341/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T20:01:19.8302683Z [342/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T20:01:20.0092683Z [343/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:20.4150492Z [344/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T20:01:20.6387150Z [345/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:22.8024665Z [346/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:24.5966333Z [347/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T20:01:27.2480359Z [348/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:29.0623044Z [349/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:29.7229933Z [350/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T20:01:29.9860683Z [351/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:32.2842757Z [352/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:33.0250622Z [353/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T20:01:36.8710545Z [354/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:37.2552584Z [355/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T20:01:37.5597265Z [356/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:41.7530671Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T20:01:45.6981981Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T20:01:50.2466218Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T20:01:58.7663032Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T20:02:03.3713808Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T20:02:03.7350621Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:02:04.3343850Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T20:02:05.0467913Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:10.1823460Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T20:02:12.2110456Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:13.1397599Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:18.4093989Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:21.2655211Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T20:02:29.6100937Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T20:02:34.7344111Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:35.1604150Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:02:47.4613161Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:47.7587986Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T20:02:54.7917918Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T20:03:00.3139044Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T20:03:00.9733543Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T20:03:06.7083274Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:03:11.3531679Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T20:03:20.8274466Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:22.3656290Z [381/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:22.7615777Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:26.3857566Z [383/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:03:30.3024493Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:30.3862857Z [385/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T20:03:32.1141769Z [386/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:03:32.3172747Z [387/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:03:33.4636704Z [388/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:44.0176636Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:49.4071081Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:50.2007057Z [391/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:03:50.7677836Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:52.7477431Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:53.8643888Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:54.5455837Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:55.1959998Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:55.3198802Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:59.3162592Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:00.6028160Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:00.9588499Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:01.0973339Z [401/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:03.5741355Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:05.0203383Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:15.4545874Z [404/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:16.1620471Z [405/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:16.4002985Z [406/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:04:18.9244594Z [407/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:19.1704887Z [408/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:04:20.2911206Z [409/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:04:21.3656383Z [410/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:21.3936239Z [411/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:21.6503381Z [412/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:04:21.9548889Z [413/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:22.1882417Z [414/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:22.7617215Z [415/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:04:23.7367943Z [416/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:04:24.6925626Z [417/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:24.7980742Z [418/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:25.0787157Z [419/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:04:25.1486086Z [420/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:04:25.3026047Z [421/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:04:26.0198729Z [422/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:27.8326586Z [423/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:04:28.1033649Z [424/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:29.1418526Z [425/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:29.8969738Z [426/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:04:30.6094261Z [427/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:30.6694593Z [428/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:30.8432336Z [429/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:30.9538331Z [430/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:31.5938274Z [431/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:31.9352092Z [432/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:31.9650110Z [433/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:04:32.1325917Z [434/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:32.2452497Z [435/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:33.3373928Z [436/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:04:33.4684067Z [437/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:34.2410782Z [438/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:34.2717211Z [439/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:04:34.3266116Z [440/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:04:34.5119584Z [441/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:34.5495187Z [442/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:04:34.6487001Z [443/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:04:34.7196199Z [444/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:34.8614483Z [445/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:35.6418335Z [446/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:04:35.9748176Z [447/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:37.4194360Z [448/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:04:37.7160410Z [449/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:04:37.7899113Z [450/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:38.7047534Z [451/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:04:39.0067982Z [452/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:39.7817034Z [453/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:04:39.9430815Z [454/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:40.7489710Z [455/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:04:40.7655472Z [456/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:04:41.0577297Z [457/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:04:42.3791095Z [458/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:04:44.2836726Z [459/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:04:45.0561282Z [460/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:47.8189169Z [461/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:04:48.4418031Z [462/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T20:04:48.7765957Z [463/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:04:49.1721128Z [464/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:04:49.3255725Z [465/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:04:50.5283787Z [466/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:04:50.5603943Z [467/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:04:52.0291144Z [468/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:04:52.2814241Z [469/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:04:53.1256639Z [470/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:04:53.3389846Z [471/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:53.6129553Z [472/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:04:53.7401984Z [473/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:04:54.3369036Z [474/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:04:54.8689861Z [475/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:04:54.9730884Z [476/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:04:55.1068120Z [477/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:04:56.0995575Z [478/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:56.1479068Z [479/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:04:57.5777196Z [480/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:04:58.1913970Z [481/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:05:01.8429105Z [482/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:01.8451104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.8452923Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.8453565Z ^ 2025-05-07T20:05:01.8453822Z 2025-05-07T20:05:01.8454256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.8454968Z 2025-05-07T20:05:01.8456338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.8458367Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.8458888Z ^ 2025-05-07T20:05:01.8459312Z 2025-05-07T20:05:01.8459761Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.8460421Z 2025-05-07T20:05:01.8461820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.8463654Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.8464198Z ^ 2025-05-07T20:05:01.8464464Z 2025-05-07T20:05:01.8464931Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.8465624Z 2025-05-07T20:05:01.8466991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.8468803Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.8469364Z ^ 2025-05-07T20:05:01.8469634Z 2025-05-07T20:05:01.8470104Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.8470751Z 2025-05-07T20:05:03.3614093Z [483/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:05:04.0270743Z [484/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:05:04.6081162Z [485/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:05:05.2132958Z [486/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:05:05.8438447Z [487/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:05:06.6933927Z [488/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:05:06.9651192Z [489/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:06.9674704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:06.9676561Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:06.9677112Z ^ 2025-05-07T20:05:06.9677377Z 2025-05-07T20:05:06.9677828Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:06.9678510Z 2025-05-07T20:05:06.9679876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:06.9681582Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:06.9682178Z ^ 2025-05-07T20:05:06.9682423Z 2025-05-07T20:05:06.9682889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:06.9683521Z 2025-05-07T20:05:06.9684848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:06.9686433Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:06.9687001Z ^ 2025-05-07T20:05:06.9687266Z 2025-05-07T20:05:06.9687721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:06.9688349Z 2025-05-07T20:05:06.9689893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:06.9691640Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:06.9692150Z ^ 2025-05-07T20:05:06.9692409Z 2025-05-07T20:05:06.9692858Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:06.9693468Z 2025-05-07T20:05:08.5043417Z [490/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:05:08.9034297Z [491/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:05:09.3426234Z [492/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:05:09.6657478Z [493/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:05:10.1082170Z [494/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:05:10.4400118Z [495/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:05:10.5869288Z [496/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:05:10.9462623Z [497/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:05:11.3505260Z [498/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:05:11.5251539Z [499/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:05:11.8738205Z [500/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:05:13.6413970Z [501/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:05:13.8304747Z [502/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:05:14.9127213Z [503/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:19.4329631Z [504/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:05:22.6292750Z [505/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:05:23.7794866Z [506/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:05:29.4536413Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:05:30.3473370Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:34.0942508Z [509/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:05:38.0260643Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:05:39.4395673Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:05:42.5597779Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:05:46.2470842Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:05:49.4319543Z [514/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:05:50.0256204Z [515/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:05:51.2348482Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:05:53.7710662Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:06:00.1728379Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:06:05.5823914Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:06:11.0723933Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:06:12.7445829Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:06:15.0200084Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:06:16.8179809Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:06:16.8868449Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:06:19.9721832Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:06:21.8791936Z [526/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:06:25.4942519Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:06:25.6787538Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:06:27.9515555Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:06:28.3626290Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:06:30.3047191Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:06:30.7699262Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:06:30.8196070Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:06:30.8390636Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:06:30.8478406Z [535/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:06:30.8480450Z ################################################################################ 2025-05-07T20:06:30.8481121Z [CMAKE] Running post-build script ... 2025-05-07T20:06:30.8482380Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:06:30.8483266Z Removing all RPATHs ... 2025-05-07T20:06:30.8483753Z ################################################################################ 2025-05-07T20:06:30.8752564Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 1 2025-05-07T20:06:30.8754773Z ################################################################################ 2025-05-07T20:06:30.8755436Z [CMAKE] Running post-build script ... 2025-05-07T20:06:30.8756413Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:06:30.8757311Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:30.8757976Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:30.8758744Z ################################################################################ 2025-05-07T20:06:30.9586234Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:30.9588508Z ################################################################################ 2025-05-07T20:06:30.9589176Z [CMAKE] Running post-build script ... 2025-05-07T20:06:30.9590157Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:30.9591182Z Removing all RPATHs ... 2025-05-07T20:06:30.9591703Z ################################################################################ 2025-05-07T20:06:31.7543603Z [538/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:06:32.3328126Z [539/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:06:32.3369527Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:32.3371768Z ################################################################################ 2025-05-07T20:06:32.3372404Z [CMAKE] Running post-build script ... 2025-05-07T20:06:32.3373403Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:32.3374401Z Removing all RPATHs ... 2025-05-07T20:06:32.3374888Z ################################################################################ 2025-05-07T20:06:32.6090102Z [541/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:06:32.6685333Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:06:32.6687486Z ################################################################################ 2025-05-07T20:06:32.6687965Z [CMAKE] Running post-build script ... 2025-05-07T20:06:32.6688790Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:06:32.6689605Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:32.6690134Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:32.6690729Z ################################################################################ 2025-05-07T20:06:32.7678250Z [543/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:06:32.7840654Z [544/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:06:32.7842517Z ################################################################################ 2025-05-07T20:06:32.7842990Z [CMAKE] Running post-build script ... 2025-05-07T20:06:32.7843801Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:06:32.7844743Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:32.7845317Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:32.7846172Z ################################################################################ 2025-05-07T20:06:32.9537328Z [545/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:32.9540126Z ################################################################################ 2025-05-07T20:06:32.9540876Z [CMAKE] Running post-build script ... 2025-05-07T20:06:32.9542071Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:32.9543255Z Removing all RPATHs ... 2025-05-07T20:06:32.9543845Z ################################################################################ 2025-05-07T20:06:33.0815973Z [546/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:06:33.0833516Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:33.0835367Z ################################################################################ 2025-05-07T20:06:33.0836005Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.0836814Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:33.0837767Z Removing all RPATHs ... 2025-05-07T20:06:33.0838153Z ################################################################################ 2025-05-07T20:06:33.1523012Z [548/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:33.1525001Z ################################################################################ 2025-05-07T20:06:33.1525452Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.1526234Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:33.1527002Z Removing all RPATHs ... 2025-05-07T20:06:33.1527396Z ################################################################################ 2025-05-07T20:06:33.4120874Z [549/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:06:33.4149271Z [550/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:06:33.4151789Z ################################################################################ 2025-05-07T20:06:33.4152324Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.4153228Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:06:33.4154268Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:33.4154820Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:33.4155389Z ################################################################################ 2025-05-07T20:06:33.4449753Z [551/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:06:33.4454464Z ################################################################################ 2025-05-07T20:06:33.4454982Z [CMAKE] Running post-build script ... 2025-05-07T20:06:33.4455873Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:06:33.4456768Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:33.4457338Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:33.4457936Z ################################################################################ 2025-05-07T20:06:35.7175927Z [552/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:06:36.8708659Z [553/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:06:36.8710996Z ################################################################################ 2025-05-07T20:06:36.8711646Z [CMAKE] Running post-build script ... 2025-05-07T20:06:36.8712660Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:06:36.8713954Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:36.8714606Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:36.8715347Z ################################################################################ 2025-05-07T20:06:37.0372856Z [554/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:06:37.3920838Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:06:37.7609111Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:06:38.2887555Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:06:38.6171865Z [558/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:06:39.3350462Z [559/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:06:39.3351823Z ################################################################################ 2025-05-07T20:06:39.3352217Z [CMAKE] Running post-build script ... 2025-05-07T20:06:39.3352857Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:06:39.3353496Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:39.3353911Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:39.3354348Z ################################################################################ 2025-05-07T20:06:39.4102122Z [560/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:06:39.6486928Z [561/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:06:39.6488285Z ################################################################################ 2025-05-07T20:06:39.6488701Z [CMAKE] Running post-build script ... 2025-05-07T20:06:39.6489304Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:06:39.6489973Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:39.6490389Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:39.6490820Z ################################################################################ 2025-05-07T20:06:40.9735483Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:06:42.8487615Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:06:46.2255059Z [564/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:06:46.3511102Z [565/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:06:49.0292652Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:06:52.8893184Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:06:54.6636534Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:06:59.1156374Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:06:59.9343379Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:07:00.3503281Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:07:00.6968562Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:07:07.3450216Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:07:12.8692389Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:07:13.6894682Z [575/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:07:16.5331131Z [576/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:07:16.5342185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:16.5343047Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:16.5343410Z ^ 2025-05-07T20:07:16.5343594Z 2025-05-07T20:07:16.5343840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:16.5344219Z 2025-05-07T20:07:16.5344749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:16.5345546Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:16.5345876Z ^ 2025-05-07T20:07:16.5346049Z 2025-05-07T20:07:16.5346320Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:16.5346911Z 2025-05-07T20:07:16.5347501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:16.5348280Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:16.5348633Z ^ 2025-05-07T20:07:16.5348803Z 2025-05-07T20:07:16.5349064Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:16.5349416Z 2025-05-07T20:07:16.5349945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:16.5350737Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:16.5351062Z ^ 2025-05-07T20:07:16.5351251Z 2025-05-07T20:07:16.5351491Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:16.5351840Z 2025-05-07T20:07:16.6450744Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:07:23.6545471Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:07:29.7816876Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:07:30.4528858Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:07:30.7595284Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:07:33.2688592Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:07:34.2098450Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:07:35.6601787Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:07:42.0246568Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:07:42.8950070Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:07:43.1510025Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:07:44.4588307Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:07:44.6248547Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:07:45.7559295Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:07:46.6549841Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:07:47.7411328Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:07:48.8665981Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:07:50.6282480Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:07:51.3865062Z [595/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:07:51.5868148Z [596/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:07:51.5869751Z ################################################################################ 2025-05-07T20:07:51.5870203Z [CMAKE] Running post-build script ... 2025-05-07T20:07:51.5870765Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:07:51.5871338Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:07:51.5871717Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:07:51.5872160Z ################################################################################ 2025-05-07T20:08:59.3392542Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:09:05.8605517Z [598/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:09:06.1009194Z [599/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:09:07.9338212Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:09:08.5530622Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs" -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib" && : 2025-05-07T20:09:08.5770121Z [602/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:09:08.7516481Z [603/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:09:08.7517861Z ################################################################################ 2025-05-07T20:09:08.7518283Z [CMAKE] Running post-build script ... 2025-05-07T20:09:08.7518924Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:08.7519608Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:08.7519987Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:08.7520441Z ################################################################################ 2025-05-07T20:09:09.2450278Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:09:09.2451713Z ################################################################################ 2025-05-07T20:09:09.2452101Z [CMAKE] Running post-build script ... 2025-05-07T20:09:09.2452900Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:09.2453552Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:09.2454004Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:09.2454529Z ################################################################################ 2025-05-07T20:09:09.7039258Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:09:09.7040699Z ################################################################################ 2025-05-07T20:09:09.7041082Z [CMAKE] Running post-build script ... 2025-05-07T20:09:09.7041769Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:09.7042462Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:09.7042871Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:09.7043343Z ################################################################################ 2025-05-07T20:09:16.4850516Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:09:19.1298153Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:09:19.1299723Z ################################################################################ 2025-05-07T20:09:19.1300137Z [CMAKE] Running post-build script ... 2025-05-07T20:09:19.1300780Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:19.1301440Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:19.1301818Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:19.1302264Z ################################################################################ 2025-05-07T20:09:19.1303274Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:09:19.1351273Z -- Install configuration: "Release" 2025-05-07T20:09:19.1351923Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:09:19.1377723Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:09:19.1378834Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:09:19.1406833Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:09:19.1411051Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:09:19.1435214Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:09:19.1452275Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:09:19.1456404Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:09:19.1457376Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:09:19.1478937Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:09:19.1479998Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:09:19.1481228Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:19.1482363Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:19.1483404Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:19.1484658Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:19.1485758Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:09:19.1486798Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:09:19.1487898Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:09:19.1489138Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:09:19.1490373Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:09:19.1491594Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:09:19.1492784Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:09:19.1494037Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:09:19.1495341Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:09:19.1496678Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:09:19.1498034Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:09:19.1499359Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:09:19.1500753Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:09:19.1502025Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:09:19.1503256Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:09:19.1504462Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:09:19.1505817Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:09:19.1507147Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:09:19.1508254Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:09:19.1517714Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:09:19.1574625Z 2025-05-07T20:09:19.1627405Z 2025-05-07T20:09:19.1628940Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:09:19.1631603Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:09:19.1634226Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:09:19.1636188Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:09:19.1637139Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:09:19.1638361Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:09:19.1639392Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:09:19.1640259Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:09:19.1641145Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:09:19.1641959Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:09:19.1642861Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:09:19.1643942Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:09:19.1645094Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:09:19.1646124Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:09:19.1647361Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:09:19.1648729Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:09:19.1650098Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:09:19.1651405Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:09:19.1652787Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:09:19.1654131Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:09:19.1655209Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:09:19.1656037Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:09:19.1656685Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config 2025-05-07T20:09:19.1657444Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:09:19.1658408Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:09:19.1659174Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs 2025-05-07T20:09:19.1660605Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:09:19.1661408Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:09:19.1662271Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:09:19.1663202Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:09:19.1664232Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:09:19.1665396Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:09:19.1666461Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:09:19.1667341Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:09:19.1668198Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:09:19.1668926Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:09:19.1669719Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:09:19.1670659Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:09:19.1671433Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll 2025-05-07T20:09:19.1672170Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:09:19.1672862Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:09:19.1673566Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:09:19.1674286Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton 2025-05-07T20:09:19.1675040Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:09:19.1676009Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:09:19.1676857Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:09:19.1677786Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:09:19.1678596Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils 2025-05-07T20:09:19.1679306Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:09:19.1680177Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:09:19.1681052Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:09:19.1681918Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:09:19.1682729Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:09:19.1683457Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:09:19.1684335Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:09:19.1685353Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:09:19.1686143Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:09:19.1687052Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:09:19.1687883Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1688661Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:09:19.1689604Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:09:19.1690754Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:09:19.1692066Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:09:19.1693269Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:09:19.1694454Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:09:19.1695764Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:09:19.1697268Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:09:19.1698752Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:09:19.1700124Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:09:19.1701620Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:09:19.1702993Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:09:19.1704282Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:09:19.1705360Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.1706152Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:09:19.1707080Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:09:19.1708068Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:09:19.1708979Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:09:19.1710077Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:09:19.1711240Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:09:19.1712274Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:09:19.1713297Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:09:19.1714401Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:09:19.1715573Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:09:19.1716749Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:09:19.1717550Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:09:19.1718316Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:09:19.1719407Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:09:19.1720336Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:19.1721108Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:09:19.1721998Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:09:19.1722883Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:09:19.1723825Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:09:19.1724984Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:09:19.1725766Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:09:19.1726696Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:09:19.1727626Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:09:19.1728640Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:09:19.1729803Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:09:19.1730720Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:09:19.1731527Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:09:19.1732580Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:09:19.1733497Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:19.1734345Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:09:19.1735549Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:09:19.1736555Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:09:19.1737395Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:09:19.1738493Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:09:19.1740187Z 2025-05-07T20:09:19.1836976Z INFO:root:running bdist_wheel 2025-05-07T20:09:19.1871529Z INFO:root:running build 2025-05-07T20:09:19.1872412Z INFO:root:running build_py 2025-05-07T20:09:19.1875395Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1878520Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1882375Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1885747Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1886966Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1888314Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1889784Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1891151Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1892463Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1893770Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1895033Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1896357Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1897876Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1899357Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1900752Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1902175Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1903643Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1905185Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1906767Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1910598Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1912184Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1913959Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1915327Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.1917728Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:09:19.1918905Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:09:19.1920608Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:09:19.1923499Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:19.1924662Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:19.1926343Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:19.1927955Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:19.1929559Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:19.1931171Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:19.1932765Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:19.1934209Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:19.1935710Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:19.1938025Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:19.1940655Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:09:19.1941867Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:09:19.1943632Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:09:19.1945746Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll 2025-05-07T20:09:19.1947220Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll 2025-05-07T20:09:19.1949881Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe 2025-05-07T20:09:19.1950931Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe 2025-05-07T20:09:19.1953326Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:19.1954674Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:19.1956390Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:19.1958139Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:19.1959920Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:19.1962389Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:19.1963539Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:19.1965623Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:19.1967006Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:19.1968512Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:19.1970673Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:09:19.1971875Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:09:19.1973494Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:09:19.1975983Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:09:19.1977100Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:09:19.1978793Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:09:19.1981987Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1983158Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1984854Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1986634Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1988430Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1990026Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1991594Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1993390Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1995211Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1997475Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.1999342Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.2001169Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.2002842Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.2004542Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:19.2007505Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2008713Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2010438Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2012210Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2013863Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2015495Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2017188Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2018829Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2020838Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2022771Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2024400Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2025943Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:19.2028168Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:09:19.2029295Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:09:19.2031083Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:09:19.2033364Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:19.2034529Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:19.2036239Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:19.2037952Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:19.2040307Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:19.2043371Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:19.2044594Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:19.2046302Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:19.2048140Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:19.2049739Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:19.2051352Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:19.2053697Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:09:19.2054917Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:09:19.2056693Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:09:19.2058818Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:19.2060115Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:19.2061833Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:19.2064112Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:09:19.2065394Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:09:19.2067245Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:09:19.2119799Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.2151539Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.2352366Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:19.3423978Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:22.7935885Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:22.7940124Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:22.9244276Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:22.9363495Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:22.9577756Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:23.0277285Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:25.7644479Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:25.8210229Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:33.5937196Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:35.3793779Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:37.2170684Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:37.5917241Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:37.6204128Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:37.8918246Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.8922773Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.8927501Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.8939067Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.8948291Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.8957873Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.8969772Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.8985076Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.8997498Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.9005427Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.9014997Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.9026757Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.9032919Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.9045249Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.9055459Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:37.9064043Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:37.9067338Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:37.9077315Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:37.9084657Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:37.9108482Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6755633Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6759764Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6763423Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6764717Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6766007Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6767455Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6768823Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6770071Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6771352Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6772606Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6773927Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6775445Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6776875Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6778214Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6780026Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6781668Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6783272Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6786435Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6789571Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6791527Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6793359Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6795111Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:09:38.6797371Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:09:38.6799740Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:09:38.6801416Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:38.6804260Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:38.6805818Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:38.6807710Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:38.6809604Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:38.6811998Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:38.6814784Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:38.6816100Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:38.6817850Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:09:38.6819995Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:09:38.6823253Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:09:38.6824760Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll 2025-05-07T20:09:38.6826527Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe 2025-05-07T20:09:38.6828903Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:38.6830445Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:38.6832053Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:38.6833817Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:09:38.6835452Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:38.6837232Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:38.6838869Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:38.6840385Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:09:38.6842003Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:09:38.6843616Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:09:38.6845412Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:09:38.6847218Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:09:38.6849254Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6850934Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6852597Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6854253Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6855836Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6857536Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6859288Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6861192Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6862840Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6864705Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6866714Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6868690Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6870536Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.6872043Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6873857Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6875299Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6876924Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6878699Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6880314Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6885275Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6888712Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6890148Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6891664Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6893112Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.6894454Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:09:38.6896058Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:09:38.6897483Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:38.6898792Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:38.6900155Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:38.6901571Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:38.6902940Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.6904336Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.6905898Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.6907466Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.6909034Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.6910703Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:09:38.6912477Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:09:38.6914110Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:38.6915995Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:38.6917749Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:09:38.6919300Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:09:38.6938979Z INFO:skbuild:copied 90 files 2025-05-07T20:09:38.6939896Z INFO:root:running build_ext 2025-05-07T20:09:38.6941219Z INFO:root:installing to _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:38.6942601Z INFO:root:running install 2025-05-07T20:09:38.6994076Z INFO:root:running install_lib 2025-05-07T20:09:38.6995049Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:38.6995849Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:09:38.6996747Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:09:38.6997936Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:38.6999505Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:38.7000648Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:09:38.7001779Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:38.7003474Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:38.7005033Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:38.7006594Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:38.7008204Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:38.7009940Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:38.7011494Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:38.7012984Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:38.7014676Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:38.7015847Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:09:38.7017034Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:38.7018621Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:38.7019778Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:09:38.7020533Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:09:38.7021674Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:38.7023225Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:38.7024387Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:09:38.7025548Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:38.7027103Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:38.7028284Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7029571Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7031236Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7032929Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7034677Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7036797Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7038530Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7040359Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7042264Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7044168Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7046000Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7048026Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7049837Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7051630Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:38.7053274Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:09:38.7054374Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:09:38.7055175Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7056347Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7057955Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7059645Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7061355Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7063037Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7064774Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7066415Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7068065Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7069807Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7071586Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7073244Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:38.7074468Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:09:38.7075722Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:38.7077387Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:38.7078666Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:38.7079456Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:38.7080712Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:38.7082500Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:38.7084209Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:38.7085763Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:38.7087388Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:38.7089100Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:38.7090268Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.7091439Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.7092973Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.7094549Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.7096128Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.7097784Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:38.7098976Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:09:38.7100143Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:38.7101733Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:38.7103294Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:09:38.7104397Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:09:38.7118643Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:09:38.7120071Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:38.7121843Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:38.7123517Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:38.7125073Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:38.7126654Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:38.7128455Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:38.7129668Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:09:38.7130802Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:38.7132288Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:38.7133817Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:38.7135351Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:38.7136809Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:38.7138454Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:38.7139972Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:38.7272138Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:38.9980813Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:38.9985335Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:39.0086735Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:39.0096495Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:39.0120327Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:39.0178307Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:39.2321315Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:39.2365752Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:39.7939326Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:39.8799425Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.0787062Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1139190Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1159781Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1366893Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1368530Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1370905Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1373020Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1375069Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1377132Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1379206Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1381309Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1383494Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1385633Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1387879Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1390116Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1392215Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1394250Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1396629Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:40.1398197Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:40.1399946Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:40.1402109Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:40.1403934Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1405389Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1836794Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1838392Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1839890Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1841436Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1842914Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1844539Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1846281Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1848177Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1849678Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1851169Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1852658Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1854293Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1856035Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1857678Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1859248Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1860877Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1862522Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1864167Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1865861Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1867547Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1869085Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1870516Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:40.1871379Z INFO:skbuild:copied 125 files 2025-05-07T20:09:40.1871678Z INFO:root:running install_egg_info 2025-05-07T20:09:40.1896163Z INFO:root:running egg_info 2025-05-07T20:09:40.1921543Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:09:40.1924440Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:09:40.1926799Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:09:40.1927781Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:09:40.2025127Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:40.2065287Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:40.2066233Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.10.egg-info 2025-05-07T20:09:40.2072066Z INFO:root:running install_scripts 2025-05-07T20:09:40.2072481Z INFO:skbuild:copied 0 files 2025-05-07T20:09:42.9149867Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:09:42.9153836Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-pzkwk5og/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:09:42.9156026Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:09:42.9425291Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:09:42.9437581Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:09:42.9438493Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:09:43.1036064Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:09:43.1164952Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:09:43.1306541Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:09:44.8875297Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:09:45.0904304Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:09:45.8055697Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:09:45.9176615Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:09:46.5103966Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:10:04.3166483Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:10:05.5473550Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:10:32.9899321Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:10:35.8011896Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:10:39.4379543Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:10:40.0243337Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:10:40.1980084Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:10:48.9004797Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:10:59.9835059Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:11:01.4519298Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:11:01.4875285Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:11:01.4878753Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:11:01.4879398Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:11:01.4879910Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:11:01.4882343Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:11:01.4885732Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:11:01.4898134Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:11:01.4900409Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:11:01.4903413Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:11:01.4905140Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:11:01.4906532Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:11:01.4908392Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:11:01.4912440Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:11:01.4937191Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:11:01.4980353Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:11:01.4982268Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:11:01.4984029Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:11:01.4985461Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:11:01.4987062Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:11:01.4988850Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:11:01.4990802Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:11:01.4992771Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:11:01.4994495Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:11:01.4996560Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:11:01.4999082Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:11:01.5001027Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:11:01.5003357Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:11:01.5005117Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:11:01.5011322Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:11:01.5012852Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:11:01.5015078Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:11:01.5016690Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:11:01.5018778Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:11:01.5021063Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:11:01.5027775Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:11:01.5030063Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:11:01.5032645Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:11:01.5035062Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:11:01.5036896Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:11:01.5039372Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:11:01.5041711Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:11:01.5044987Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:11:01.5049443Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:11:01.5052328Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:11:01.5054584Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:11:01.5060369Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:11:01.5065776Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:11:01.5067880Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:11:01.5071826Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:11:01.5077554Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:11:01.5080220Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:11:01.5083060Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:11:01.5087009Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:11:01.5089328Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:11:01.5091372Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:11:01.5093960Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:11:01.5097901Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:11:01.5101001Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:11:01.5105037Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:11:01.5108078Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:11:01.5111151Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:11:01.5114513Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:11:01.5117710Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:11:01.5120660Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:11:01.5122875Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:11:01.5125452Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:11:01.5127164Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:11:01.5129285Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:11:01.5131947Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:11:01.5137003Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:11:01.5139384Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:11:01.5142071Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:11:01.5144042Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:11:01.5145683Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:11:01.5148955Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:11:01.5151842Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:11:01.5154405Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:11:01.5156280Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:11:01.5158163Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:11:01.5159956Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:11:01.5161616Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:11:01.5163090Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:11:01.5169365Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:11:01.5196603Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:11:01.5198585Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:11:01.5200994Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:11:01.5202641Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:11:01.5205341Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:11:01.5207295Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:11:01.5209085Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:11:01.5210904Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:11:01.5213555Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:11:01.5219484Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:11:01.5221512Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:11:01.5223172Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:11:01.5231194Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:11:01.5235918Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:11:01.5237920Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:11:01.5246233Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:11:01.5248589Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:11:01.5250946Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:11:01.5252664Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:11:01.5254982Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:11:01.5257722Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:11:01.5258689Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:11:01.5259632Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:11:01.5266677Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:11:01.5270547Z INFO:root:removing _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:11:01.6814839Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:11:01.6816382Z │ │ Version │ 2025-05-07T20:11:01.6817936Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:11:01.6819404Z │ PyTorch │ 2.8.0.dev20250507+cu126 │ 2025-05-07T20:11:01.6820966Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:11:01.6822504Z │ CUDA (Declared by PyTorch) │ 12.6 │ 2025-05-07T20:11:01.6824447Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:11:01.6825962Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:11:01.6827282Z │ │ Copyright (c) 2005-2024 NVIDIA Corporation │ 2025-05-07T20:11:01.6827754Z │ │ Built on Tue_Oct_29_23:50:19_PDT_2024 │ 2025-05-07T20:11:01.6828220Z │ │ Cuda compilation tools, release 12.6, V12.6.85 │ 2025-05-07T20:11:01.6828662Z │ │ Build cuda_12.6.r12.6/compiler.35059454_0 │ 2025-05-07T20:11:01.6829181Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:11:01.9263064Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:01.9997423Z 2025-05-07T20:11:02.0165519Z ################################################################################ 2025-05-07T20:11:02.0166069Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:11:02.0166519Z [CHECK] Listing out library size: 2025-05-07T20:11:02.0166927Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:11:02.0167270Z 2025-05-07T20:11:02.0186713Z 1 ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:11:02.0188746Z 2025-05-07T20:11:02.0189315Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:11:02.0190187Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.0190745Z 2025-05-07T20:11:02.0261515Z GLIBC_2.2.5 2025-05-07T20:11:02.0262167Z GLIBC_2.14 2025-05-07T20:11:02.0264466Z 2025-05-07T20:11:02.0264472Z 2025-05-07T20:11:02.0264861Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:11:02.0265872Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.0266425Z 2025-05-07T20:11:02.0327484Z 2025-05-07T20:11:02.0327636Z 2025-05-07T20:11:02.0351204Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so > /tmp/tmp.fdfkooTMN1.symbols.txt 2025-05-07T20:11:02.0351873Z 2025-05-07T20:11:02.0380853Z 2025-05-07T20:11:02.0406232Z [CHECK] Total Number of symbols: 803 2025-05-07T20:11:02.0422816Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:11:02.0439439Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so > /tmp/tmp.aANL47LIeU.usymbols.txt 2025-05-07T20:11:02.0442402Z 2025-05-07T20:11:02.0454616Z 2025-05-07T20:11:02.0479782Z [CHECK] Listing out undefined symbols (49 total): 2025-05-07T20:11:02.0496658Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.0497709Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:02.0498683Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:02.0499635Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:11:02.0500602Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.0501523Z U __popcountdi2@GCC_3.4 2025-05-07T20:11:02.0502049Z U abort@GLIBC_2.2.5 2025-05-07T20:11:02.0502354Z U close@GLIBC_2.2.5 2025-05-07T20:11:02.0502652Z U fputs@GLIBC_2.2.5 2025-05-07T20:11:02.0502979Z U free@GLIBC_2.2.5 2025-05-07T20:11:02.0503414Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:11:02.0503756Z U fwrite@GLIBC_2.2.5 2025-05-07T20:11:02.0504055Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:02.0504501Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:11:02.0504803Z U madvise@GLIBC_2.2.5 2025-05-07T20:11:02.0505117Z U malloc@GLIBC_2.2.5 2025-05-07T20:11:02.0505430Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:02.0505715Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.0506221Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.0506532Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.0506848Z U mmap@GLIBC_2.2.5 2025-05-07T20:11:02.0507190Z U mprotect@GLIBC_2.2.5 2025-05-07T20:11:02.0507519Z U munmap@GLIBC_2.2.5 2025-05-07T20:11:02.0507811Z U open64@GLIBC_2.2.5 2025-05-07T20:11:02.0508182Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.0508564Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:11:02.0508927Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:02.0509289Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:02.0509600Z U read@GLIBC_2.2.5 2025-05-07T20:11:02.0509910Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:02.0510200Z U shm_open 2025-05-07T20:11:02.0510478Z U shm_unlink 2025-05-07T20:11:02.0510752Z U snprintf@GLIBC_2.2.5 2025-05-07T20:11:02.0511066Z U stderr@GLIBC_2.2.5 2025-05-07T20:11:02.0511351Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:02.0511665Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.0511976Z U strtol@GLIBC_2.2.5 2025-05-07T20:11:02.0512251Z U syscall@GLIBC_2.2.5 2025-05-07T20:11:02.0512609Z U sysconf@GLIBC_2.2.5 2025-05-07T20:11:02.0512886Z U uname@GLIBC_2.2.5 2025-05-07T20:11:02.0513171Z U unlink@GLIBC_2.2.5 2025-05-07T20:11:02.0513445Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:11:02.0513828Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.0514237Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.0514669Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.0515056Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.0515364Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.0515679Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.0515965Z w __gmon_start__ 2025-05-07T20:11:02.0516607Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.0517019Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:11:02.0517383Z 2025-05-07T20:11:02.0551097Z linux-vdso.so.1 (0x00007fff08b5a000) 2025-05-07T20:11:02.0551474Z libtorch_cpu.so => not found 2025-05-07T20:11:02.0551765Z libtorch_cuda.so => not found 2025-05-07T20:11:02.0552282Z libtorch.so => not found 2025-05-07T20:11:02.0552620Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f12c38b5000) 2025-05-07T20:11:02.0553073Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f12c3887000) 2025-05-07T20:11:02.0553464Z libc.so.6 => /lib64/libc.so.6 (0x00007f12c367d000) 2025-05-07T20:11:02.0553845Z libm.so.6 => /lib64/libm.so.6 (0x00007f12c35a2000) 2025-05-07T20:11:02.0554240Z /lib64/ld-linux-x86-64.so.2 (0x00007f12c3b98000) 2025-05-07T20:11:02.0558784Z 2025-05-07T20:11:02.0558926Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.0559353Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:11:02.0559640Z 2025-05-07T20:11:02.0595451Z 2025-05-07T20:11:02.0595800Z Dynamic section at offset 0x78e78 contains 33 entries: 2025-05-07T20:11:02.0596369Z Tag Type Name/Value 2025-05-07T20:11:02.0597013Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.0597590Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.0598129Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.0598651Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.0599191Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.0599896Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.0600418Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:11:02.0600871Z 0x000000000000000c (INIT) 0x1a000 2025-05-07T20:11:02.0601278Z 0x000000000000000d (FINI) 0x5af2c 2025-05-07T20:11:02.0601645Z 0x0000000000000019 (INIT_ARRAY) 0x780a0 2025-05-07T20:11:02.0602016Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.0602370Z 0x000000000000001a (FINI_ARRAY) 0x780a8 2025-05-07T20:11:02.0602739Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.0603086Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:02.0603447Z 0x000000006ffffef5 (GNU_HASH) 0x1e18 2025-05-07T20:11:02.0603792Z 0x0000000000000005 (STRTAB) 0x86e0 2025-05-07T20:11:02.0604147Z 0x0000000000000006 (SYMTAB) 0x3b80 2025-05-07T20:11:02.0604505Z 0x000000000000000a (STRSZ) 45342 (bytes) 2025-05-07T20:11:02.0604899Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.0605277Z 0x0000000000000003 (PLTGOT) 0x790d8 2025-05-07T20:11:02.0605638Z 0x0000000000000002 (PLTRELSZ) 8064 (bytes) 2025-05-07T20:11:02.0606020Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.0606357Z 0x0000000000000017 (JMPREL) 0x17220 2025-05-07T20:11:02.0606775Z 0x0000000000000007 (RELA) 0x13ed8 2025-05-07T20:11:02.0607221Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:11:02.0607620Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.0608032Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.0608377Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.0608749Z 0x000000006ffffffe (VERNEED) 0x13e48 2025-05-07T20:11:02.0609341Z 0x000000006fffffff (VERNEEDNUM) 3 2025-05-07T20:11:02.0609738Z 0x000000006ffffff0 (VERSYM) 0x137fe 2025-05-07T20:11:02.0610051Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:11:02.0610570Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.0610778Z 2025-05-07T20:11:02.0610898Z ################################################################################ 2025-05-07T20:11:02.0611140Z 2025-05-07T20:11:02.0611144Z 2025-05-07T20:11:02.0611258Z ################################################################################ 2025-05-07T20:11:02.0611748Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:02.0612209Z [CHECK] Listing out library size: 2025-05-07T20:11:02.0612699Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:02.0613043Z 2025-05-07T20:11:02.0613257Z 1 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:02.0613536Z 2025-05-07T20:11:02.0613896Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:02.0614829Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.0615379Z 2025-05-07T20:11:02.0658807Z GLIBC_2.2.5 2025-05-07T20:11:02.0659906Z GLIBC_2.14 2025-05-07T20:11:02.0660823Z 2025-05-07T20:11:02.0660861Z 2025-05-07T20:11:02.0662362Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:02.0665113Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.0665721Z 2025-05-07T20:11:02.0714833Z GLIBCXX_3.4 2025-05-07T20:11:02.0715507Z GLIBCXX_3.4.9 2025-05-07T20:11:02.0716380Z GLIBCXX_3.4.21 2025-05-07T20:11:02.0716517Z 2025-05-07T20:11:02.0716539Z 2025-05-07T20:11:02.0738895Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.rMhi8YKRom.symbols.txt 2025-05-07T20:11:02.0740594Z 2025-05-07T20:11:02.0754510Z 2025-05-07T20:11:02.0783022Z [CHECK] Total Number of symbols: 107 2025-05-07T20:11:02.0801641Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:11:02.0824114Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.kFMjrLc3Bi.usymbols.txt 2025-05-07T20:11:02.0824620Z 2025-05-07T20:11:02.0842039Z 2025-05-07T20:11:02.0877360Z [CHECK] Listing out undefined symbols (57 total): 2025-05-07T20:11:02.0895765Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.0896428Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.0896769Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:02.0897091Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:02.0897434Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:02.0897751Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:02.0898095Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:02.0898411Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:02.0898735Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:11:02.0899063Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.0899401Z U c10::BoolType::get() 2025-05-07T20:11:02.0900674Z U c10::StringType::get() 2025-05-07T20:11:02.0901020Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:02.0901783Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:02.0903011Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.0903840Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:02.0904155Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:02.0904448Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.0904767Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.0905065Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.0905423Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.0905814Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.0906238Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:02.0906992Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:02.0907884Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.0908787Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.0909158Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.0909559Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.0909962Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.0910340Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.0910939Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:02.0911935Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.0912693Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:02.0913064Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:02.0913409Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.0913812Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.0914184Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:02.0914558Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:02.0914900Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.0915193Z U strtol@GLIBC_2.2.5 2025-05-07T20:11:02.0915541Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:02.0916619Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:02.0917877Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:11:02.0918927Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:02.0919593Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:11:02.0920045Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.0920511Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:02.0921002Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.0921661Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.0922463Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.0923148Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:02.0923745Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:02.0924223Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.0924598Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.0924940Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.0925295Z w __gmon_start__ 2025-05-07T20:11:02.0925632Z w __pthread_key_create 2025-05-07T20:11:02.0926002Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.0926492Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:02.0926805Z 2025-05-07T20:11:02.0943241Z linux-vdso.so.1 (0x00007ffc445ac000) 2025-05-07T20:11:02.0944300Z libc10.so => not found 2025-05-07T20:11:02.0944632Z libtorch_cpu.so => not found 2025-05-07T20:11:02.0944934Z libtorch_cuda.so => not found 2025-05-07T20:11:02.0945259Z libtorch.so => not found 2025-05-07T20:11:02.0945666Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fab1095a000) 2025-05-07T20:11:02.0946156Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fab1092a000) 2025-05-07T20:11:02.0946864Z libc.so.6 => /lib64/libc.so.6 (0x00007fab10722000) 2025-05-07T20:11:02.0947275Z libm.so.6 => /lib64/libm.so.6 (0x00007fab10647000) 2025-05-07T20:11:02.0947673Z /lib64/ld-linux-x86-64.so.2 (0x00007fab10bce000) 2025-05-07T20:11:02.0948049Z 2025-05-07T20:11:02.0948175Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.0948645Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:02.0948979Z 2025-05-07T20:11:02.0985725Z 2025-05-07T20:11:02.0986465Z Dynamic section at offset 0xab00 contains 34 entries: 2025-05-07T20:11:02.0987673Z Tag Type Name/Value 2025-05-07T20:11:02.0988984Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:02.0990479Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.0991902Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.0992575Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.0993130Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.0993712Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.0994252Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.0994820Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:11:02.0995279Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:11:02.0995644Z 0x000000000000000d (FINI) 0x817c 2025-05-07T20:11:02.0995983Z 0x0000000000000019 (INIT_ARRAY) 0xaa58 2025-05-07T20:11:02.0996495Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:11:02.0996859Z 0x000000000000001a (FINI_ARRAY) 0xaa68 2025-05-07T20:11:02.0997240Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.0997624Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:02.0997967Z 0x000000006ffffef5 (GNU_HASH) 0x700 2025-05-07T20:11:02.0998341Z 0x0000000000000005 (STRTAB) 0x13b0 2025-05-07T20:11:02.0998686Z 0x0000000000000006 (SYMTAB) 0x990 2025-05-07T20:11:02.0999073Z 0x000000000000000a (STRSZ) 6890 (bytes) 2025-05-07T20:11:02.0999448Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.0999902Z 0x0000000000000003 (PLTGOT) 0xad70 2025-05-07T20:11:02.1000268Z 0x0000000000000002 (PLTRELSZ) 1272 (bytes) 2025-05-07T20:11:02.1000657Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.1001029Z 0x0000000000000017 (JMPREL) 0x34a8 2025-05-07T20:11:02.1001374Z 0x0000000000000007 (RELA) 0x3028 2025-05-07T20:11:02.1001761Z 0x0000000000000008 (RELASZ) 1152 (bytes) 2025-05-07T20:11:02.1002129Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.1002499Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.1002846Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.1003235Z 0x000000006ffffffe (VERNEED) 0x2f78 2025-05-07T20:11:02.1003585Z 0x000000006fffffff (VERNEEDNUM) 3 2025-05-07T20:11:02.1003953Z 0x000000006ffffff0 (VERSYM) 0x2e9a 2025-05-07T20:11:02.1004319Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:11:02.1004645Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.1004896Z 2025-05-07T20:11:02.1005020Z ################################################################################ 2025-05-07T20:11:02.1005366Z 2025-05-07T20:11:02.1005371Z 2025-05-07T20:11:02.1005496Z ################################################################################ 2025-05-07T20:11:02.1005973Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:11:02.1006439Z [CHECK] Listing out library size: 2025-05-07T20:11:02.1006849Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:11:02.1007199Z 2025-05-07T20:11:02.1009346Z 6 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:11:02.1009599Z 2025-05-07T20:11:02.1010043Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:11:02.1010989Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.1011554Z 2025-05-07T20:11:02.1272779Z GLIBC_2.2.5 2025-05-07T20:11:02.1273450Z GLIBC_2.3 2025-05-07T20:11:02.1274070Z GLIBC_2.14 2025-05-07T20:11:02.1275777Z 2025-05-07T20:11:02.1275790Z 2025-05-07T20:11:02.1276660Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:11:02.1277596Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.1278181Z 2025-05-07T20:11:02.1537634Z GLIBCXX_3.4 2025-05-07T20:11:02.1537988Z GLIBCXX_3.4.9 2025-05-07T20:11:02.1538644Z GLIBCXX_3.4.11 2025-05-07T20:11:02.1538946Z GLIBCXX_3.4.14 2025-05-07T20:11:02.1539265Z GLIBCXX_3.4.15 2025-05-07T20:11:02.1539813Z GLIBCXX_3.4.18 2025-05-07T20:11:02.1540046Z GLIBCXX_3.4.21 2025-05-07T20:11:02.1540615Z 2025-05-07T20:11:02.1540620Z 2025-05-07T20:11:02.1561371Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so > /tmp/tmp.cw1rGFBFSk.symbols.txt 2025-05-07T20:11:02.1562599Z 2025-05-07T20:11:02.1778886Z 2025-05-07T20:11:02.1803353Z [CHECK] Total Number of symbols: 4871 2025-05-07T20:11:02.1823966Z [CHECK] Number of fbgemm symbols: 3365 2025-05-07T20:11:02.1839683Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so > /tmp/tmp.d52R02GAoc.usymbols.txt 2025-05-07T20:11:02.1840953Z 2025-05-07T20:11:02.1867411Z 2025-05-07T20:11:02.1893500Z [CHECK] Listing out undefined symbols (135 total): 2025-05-07T20:11:02.1914537Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.1915441Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:02.1916126Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:02.1916529Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:02.1916884Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:02.1917256Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:02.1917806Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:02.1918185Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:02.1918559Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:02.1918937Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:11:02.1919326Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:02.1919659Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:02.1920024Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:02.1920385Z U __cxa_throw_bad_array_new_length@CXXABI_1.3.8 2025-05-07T20:11:02.1920800Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.1921152Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:11:02.1921510Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:02.1921861Z U abort@GLIBC_2.2.5 2025-05-07T20:11:02.1922300Z U asmjit::_abi_1_13::BaseAssembler::bind(asmjit::_abi_1_13::Label const&) 2025-05-07T20:11:02.1922930Z U asmjit::_abi_1_13::BaseAssembler::newLabel() 2025-05-07T20:11:02.1923426Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:02.1924257Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:02.1925200Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:02.1926317Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:02.1927421Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:11:02.1928212Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:11:02.1928787Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:11:02.1929418Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:11:02.1930060Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:11:02.1930556Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:11:02.1931164Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:11:02.1931932Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:11:02.1932534Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:11:02.1933001Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:11:02.1933604Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:11:02.1934247Z U asmjit::_abi_1_13::JitRuntime::_add(void**, asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:11:02.1934731Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:11:02.1935188Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:11:02.1935686Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:11:02.1936036Z U cpuinfo_get_packages 2025-05-07T20:11:02.1936370Z U cpuinfo_get_packages_count 2025-05-07T20:11:02.1936677Z U cpuinfo_initialize 2025-05-07T20:11:02.1936994Z U cpuinfo_isa 2025-05-07T20:11:02.1937320Z U fma@GLIBC_2.2.5 2025-05-07T20:11:02.1937595Z U fmaf@GLIBC_2.2.5 2025-05-07T20:11:02.1937905Z U fminf@GLIBC_2.2.5 2025-05-07T20:11:02.1938182Z U free@GLIBC_2.2.5 2025-05-07T20:11:02.1938496Z U fwrite@GLIBC_2.2.5 2025-05-07T20:11:02.1938782Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:02.1939095Z U log2@GLIBC_2.2.5 2025-05-07T20:11:02.1939376Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:02.1939687Z U lrintf@GLIBC_2.2.5 2025-05-07T20:11:02.1939984Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:02.1940308Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.1940629Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.1940917Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.1941235Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:11:02.1941548Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:11:02.1941925Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.1942302Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:11:02.1942677Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.1943064Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.1943439Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:11:02.1943774Z U pow@GLIBC_2.2.5 2025-05-07T20:11:02.1944056Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:11:02.1944476Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:02.1944967Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:02.1945437Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:02.1946097Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:02.1947233Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:11:02.1948322Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:11:02.1949538Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:02.1950360Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:02.1950928Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:02.1951394Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:02.1951914Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:11:02.1952471Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:11:02.1952964Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:11:02.1953429Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:11:02.1953789Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:11:02.1954176Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.1954564Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:02.1954925Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:11:02.1955330Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:02.1955734Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:11:02.1956259Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.1956674Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.1957154Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:02.1957571Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.1958405Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.1959237Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:11:02.1959557Z U std::cout@GLIBCXX_3.4 2025-05-07T20:11:02.1959969Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:11:02.1960405Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:11:02.1960793Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:11:02.1961414Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:02.1961787Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:02.1962467Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:02.1963271Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:11:02.1963811Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:11:02.1964349Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.1964906Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.1965420Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:11:02.1965821Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:02.1966194Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:11:02.1966684Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:02.1967224Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:02.1967701Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:02.1968108Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:02.1968437Z U stderr@GLIBC_2.2.5 2025-05-07T20:11:02.1968768Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:02.1969074Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.1969403Z U strstr@GLIBC_2.2.5 2025-05-07T20:11:02.1969709Z U tolower@GLIBC_2.2.5 2025-05-07T20:11:02.1970081Z U toupper@GLIBC_2.2.5 2025-05-07T20:11:02.1970482Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:11:02.1971013Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:11:02.1971441Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:11:02.1971853Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:02.1972298Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.1972750Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.1973200Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:11:02.1973618Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:11:02.1974113Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.1974584Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.1974906Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.1975241Z w __gmon_start__ 2025-05-07T20:11:02.1975525Z w __pthread_key_create 2025-05-07T20:11:02.1975877Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:02.1976214Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:02.1976565Z w pthread_once 2025-05-07T20:11:02.1976910Z w pthread_rwlock_rdlock 2025-05-07T20:11:02.1977227Z w pthread_rwlock_unlock 2025-05-07T20:11:02.1977566Z w pthread_rwlock_wrlock 2025-05-07T20:11:02.1977883Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:11:02.1978273Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.1978679Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:11:02.1978961Z 2025-05-07T20:11:02.1979103Z linux-vdso.so.1 (0x00007ffc610ec000) 2025-05-07T20:11:02.1979432Z libc10.so => not found 2025-05-07T20:11:02.1979931Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f1dbbc1c000) 2025-05-07T20:11:02.1980504Z libtorch.so => not found 2025-05-07T20:11:02.1980760Z libtorch_cpu.so => not found 2025-05-07T20:11:02.1981068Z libtorch_cuda.so => not found 2025-05-07T20:11:02.1981402Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f1dbb39c000) 2025-05-07T20:11:02.1981814Z libm.so.6 => /lib64/libm.so.6 (0x00007f1dbbb3f000) 2025-05-07T20:11:02.1982189Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f1dbb36e000) 2025-05-07T20:11:02.1982589Z libc.so.6 => /lib64/libc.so.6 (0x00007f1dbb166000) 2025-05-07T20:11:02.1983037Z /lib64/ld-linux-x86-64.so.2 (0x00007f1dbbc9b000) 2025-05-07T20:11:02.1983366Z libtorch_cpu.so => not found 2025-05-07T20:11:02.1983668Z libtorch_cuda.so => not found 2025-05-07T20:11:02.1983937Z libtorch.so => not found 2025-05-07T20:11:02.1984128Z 2025-05-07T20:11:02.1984243Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.1984616Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:11:02.1984922Z 2025-05-07T20:11:02.1998202Z 2025-05-07T20:11:02.1998708Z Dynamic section at offset 0x51fb38 contains 38 entries: 2025-05-07T20:11:02.1999855Z Tag Type Name/Value 2025-05-07T20:11:02.2001060Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:02.2002576Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:11:02.2004054Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.2005547Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.2006670Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.2007206Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.2007747Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:02.2008312Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.2008858Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.2009452Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:02.2009992Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:11:02.2010525Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:02.2010948Z 0x000000000000000c (INIT) 0xf6000 2025-05-07T20:11:02.2011330Z 0x000000000000000d (FINI) 0x4c8fb0 2025-05-07T20:11:02.2011692Z 0x0000000000000019 (INIT_ARRAY) 0x51dac0 2025-05-07T20:11:02.2012093Z 0x000000000000001b (INIT_ARRAYSZ) 56 (bytes) 2025-05-07T20:11:02.2012464Z 0x000000000000001a (FINI_ARRAY) 0x51daf8 2025-05-07T20:11:02.2012856Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.2013235Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:02.2013583Z 0x000000006ffffef5 (GNU_HASH) 0x6e20 2025-05-07T20:11:02.2013958Z 0x0000000000000005 (STRTAB) 0x2b0a0 2025-05-07T20:11:02.2014310Z 0x0000000000000006 (SYMTAB) 0xe7e0 2025-05-07T20:11:02.2014710Z 0x000000000000000a (STRSZ) 708057 (bytes) 2025-05-07T20:11:02.2015094Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.2015520Z 0x0000000000000003 (PLTGOT) 0x520dd8 2025-05-07T20:11:02.2015919Z 0x0000000000000002 (PLTRELSZ) 24312 (bytes) 2025-05-07T20:11:02.2016283Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.2016650Z 0x0000000000000017 (JMPREL) 0xef8e0 2025-05-07T20:11:02.2016997Z 0x0000000000000007 (RELA) 0xda610 2025-05-07T20:11:02.2017394Z 0x0000000000000008 (RELASZ) 86736 (bytes) 2025-05-07T20:11:02.2017776Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.2018144Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.2018489Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.2018880Z 0x000000006ffffffe (VERNEED) 0xda490 2025-05-07T20:11:02.2019250Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:02.2019596Z 0x000000006ffffff0 (VERSYM) 0xd7e7a 2025-05-07T20:11:02.2019963Z 0x000000006ffffff9 (RELACOUNT) 9 2025-05-07T20:11:02.2020288Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.2020525Z 2025-05-07T20:11:02.2020649Z ################################################################################ 2025-05-07T20:11:02.2020917Z 2025-05-07T20:11:02.2020922Z 2025-05-07T20:11:02.2021047Z ################################################################################ 2025-05-07T20:11:02.2021571Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:02.2022098Z [CHECK] Listing out library size: 2025-05-07T20:11:02.2022565Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:02.2022968Z 2025-05-07T20:11:02.2023187Z 3 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:02.2023495Z 2025-05-07T20:11:02.2023888Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:02.2024905Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.2025497Z 2025-05-07T20:11:02.2075215Z GLIBC_2.2.5 2025-05-07T20:11:02.2075878Z GLIBC_2.3 2025-05-07T20:11:02.2076728Z GLIBC_2.14 2025-05-07T20:11:02.2077064Z 2025-05-07T20:11:02.2077078Z 2025-05-07T20:11:02.2078282Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:02.2081607Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.2083406Z 2025-05-07T20:11:02.2136813Z GLIBCXX_3.4 2025-05-07T20:11:02.2137638Z GLIBCXX_3.4.9 2025-05-07T20:11:02.2137998Z GLIBCXX_3.4.14 2025-05-07T20:11:02.2138437Z GLIBCXX_3.4.20 2025-05-07T20:11:02.2138736Z GLIBCXX_3.4.21 2025-05-07T20:11:02.2138987Z GLIBCXX_3.4.29 2025-05-07T20:11:02.2139125Z 2025-05-07T20:11:02.2139135Z 2025-05-07T20:11:02.2155840Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.kqK5P6BLcO.symbols.txt 2025-05-07T20:11:02.2156687Z 2025-05-07T20:11:02.2183350Z 2025-05-07T20:11:02.2216309Z [CHECK] Total Number of symbols: 505 2025-05-07T20:11:02.2232332Z [CHECK] Number of fbgemm symbols: 47 2025-05-07T20:11:02.2249763Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.noe9ZAmVTU.usymbols.txt 2025-05-07T20:11:02.2250253Z 2025-05-07T20:11:02.2277552Z 2025-05-07T20:11:02.2310829Z [CHECK] Listing out undefined symbols (195 total): 2025-05-07T20:11:02.2333525Z U GOMP_barrier 2025-05-07T20:11:02.2334002Z U GOMP_parallel 2025-05-07T20:11:02.2334609Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.2335399Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.2335796Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.2336378Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.2336813Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.2337217Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:02.2337643Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:02.2338020Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:02.2338429Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.2338808Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:02.2339183Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:02.2339545Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:02.2339885Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:02.2340256Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:02.2340599Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:02.2340972Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:02.2341315Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:02.2341677Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:02.2342087Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:02.2342457Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.2342825Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:02.2343337Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:11:02.2343965Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:02.2344447Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:02.2345413Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.2346350Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:11:02.2347099Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:02.2347721Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:02.2348420Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:02.2349629Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:02.2350569Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:02.2351411Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.2352248Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:02.2352630Z U at::get_num_threads() 2025-05-07T20:11:02.2352949Z U at::get_thread_num() 2025-05-07T20:11:02.2353298Z U at::in_parallel_region() 2025-05-07T20:11:02.2353618Z U at::init_num_threads() 2025-05-07T20:11:02.2353976Z U at::internal::set_thread_num(int) 2025-05-07T20:11:02.2354376Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:11:02.2354740Z U c10::BoolType::get() 2025-05-07T20:11:02.2355137Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:02.2355803Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:02.2356571Z U c10::Error::what() const 2025-05-07T20:11:02.2356979Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.2357436Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.2357927Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:02.2358372Z U c10::IntType::get() 2025-05-07T20:11:02.2358761Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:02.2359208Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:02.2359707Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.2360194Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:02.2360602Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:02.2360996Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:02.2361442Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:02.2362138Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:02.2362865Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:02.2363294Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:02.2363682Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:02.2364081Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:02.2364481Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:02.2364821Z U c10::SymIntType::get() 2025-05-07T20:11:02.2365219Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:02.2365601Z U c10::TensorType::get() 2025-05-07T20:11:02.2365963Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:02.2366949Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:02.2367929Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:02.2368496Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:11:02.2369022Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:11:02.2369757Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:11:02.2370333Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:02.2370711Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:02.2371074Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:02.2371410Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:02.2371776Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:02.2372259Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:02.2372713Z U c10::cuda::device_count() 2025-05-07T20:11:02.2373083Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:02.2373457Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:02.2373869Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:02.2374262Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:02.2374682Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:02.2375084Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:02.2375788Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:02.2376874Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:02.2377771Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.2378709Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:02.2379756Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.2380783Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:02.2381151Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:02.2381536Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:02.2381967Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:02.2382366Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:02.2382798Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:02.2397095Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:02.2397545Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:02.2397988Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:02.2398373Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:02.2398786Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:02.2399256Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:02.2399723Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:02.2400147Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:02.2400540Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:02.2400940Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:02.2401308Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:02.2401701Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:02.2402098Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:02.2402474Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:02.2402867Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:02.2403227Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:02.2403712Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:02.2404087Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:02.2404537Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:02.2405592Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2407263Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2409058Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2410630Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2412253Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2414376Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2416183Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2418005Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2419827Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2421658Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2423490Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2425329Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.2426739Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:11:02.2427168Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:11:02.2427857Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:02.2428382Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.2429016Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.2429497Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.2430063Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.2430568Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:02.2431016Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.2431455Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.2431839Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.2432177Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.2432519Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.2432831Z U omp_get_max_threads 2025-05-07T20:11:02.2433162Z U omp_get_num_threads 2025-05-07T20:11:02.2433464Z U omp_get_thread_num 2025-05-07T20:11:02.2433841Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.2434245Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.2434875Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:02.2435765Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.2436757Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:02.2437461Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:02.2437909Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:02.2438305Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.2438723Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:02.2439134Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.2439571Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.2440003Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:02.2440574Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:02.2441313Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:02.2442366Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.2443635Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.2444415Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:02.2444797Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:02.2445199Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:02.2445569Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:02.2445963Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.2446349Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.2447003Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:02.2447385Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:02.2447813Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.2448395Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.2448898Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:02.2449558Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:11:02.2452147Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:11:02.2453313Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:11:02.2454196Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:02.2454682Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.2455020Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:02.2455885Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:02.2457088Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.2457931Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.2458950Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:02.2459560Z U typeinfo for c10::Error 2025-05-07T20:11:02.2459921Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:02.2460363Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.2460803Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.2461242Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:02.2461663Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.2462051Z U vtable for c10::Error 2025-05-07T20:11:02.2462605Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.2463351Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.2463988Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:02.2464531Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:02.2465007Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.2465350Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.2465661Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.2465986Z w __gmon_start__ 2025-05-07T20:11:02.2466259Z w __pthread_key_create 2025-05-07T20:11:02.2466623Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.2467059Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:02.2467386Z 2025-05-07T20:11:02.2467540Z linux-vdso.so.1 (0x00007ffe1c1af000) 2025-05-07T20:11:02.2467856Z libc10.so => not found 2025-05-07T20:11:02.2468105Z libc10_cuda.so => not found 2025-05-07T20:11:02.2468642Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f1a12000000) 2025-05-07T20:11:02.2469532Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f1a12afa000) 2025-05-07T20:11:02.2470145Z libtorch.so => not found 2025-05-07T20:11:02.2470428Z libtorch_cpu.so => not found 2025-05-07T20:11:02.2470697Z libtorch_cuda.so => not found 2025-05-07T20:11:02.2470994Z libcudart.so.12 => not found 2025-05-07T20:11:02.2471320Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f1a11d9c000) 2025-05-07T20:11:02.2471783Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f1a12aca000) 2025-05-07T20:11:02.2472156Z libc.so.6 => /lib64/libc.so.6 (0x00007f1a11b94000) 2025-05-07T20:11:02.2472568Z /lib64/ld-linux-x86-64.so.2 (0x00007f1a12b0a000) 2025-05-07T20:11:02.2472925Z libc10.so => not found 2025-05-07T20:11:02.2473421Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f1a12a4d000) 2025-05-07T20:11:02.2473993Z libtorch.so => not found 2025-05-07T20:11:02.2474253Z libtorch_cpu.so => not found 2025-05-07T20:11:02.2474551Z libtorch_cuda.so => not found 2025-05-07T20:11:02.2474849Z libm.so.6 => /lib64/libm.so.6 (0x00007f1a12970000) 2025-05-07T20:11:02.2475375Z libc10.so => not found 2025-05-07T20:11:02.2475635Z libtorch_cpu.so => not found 2025-05-07T20:11:02.2475946Z libtorch_cuda.so => not found 2025-05-07T20:11:02.2476361Z libtorch.so => not found 2025-05-07T20:11:02.2476634Z libtorch_cpu.so => not found 2025-05-07T20:11:02.2477133Z libtorch_cuda.so => not found 2025-05-07T20:11:02.2477421Z libtorch.so => not found 2025-05-07T20:11:02.2477620Z 2025-05-07T20:11:02.2477744Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.2478197Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:02.2478570Z 2025-05-07T20:11:02.2478575Z 2025-05-07T20:11:02.2478738Z Dynamic section at offset 0x2c4138 contains 40 entries: 2025-05-07T20:11:02.2479212Z Tag Type Name/Value 2025-05-07T20:11:02.2479643Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:02.2480198Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:02.2480722Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:02.2481281Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:02.2481822Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.2482382Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.2482943Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.2483481Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:02.2484049Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.2484578Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.2485119Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.2485676Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:02.2486272Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:02.2486834Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:02.2487252Z 0x000000000000000c (INIT) 0x13000 2025-05-07T20:11:02.2487626Z 0x000000000000000d (FINI) 0x7422c 2025-05-07T20:11:02.2487979Z 0x0000000000000019 (INIT_ARRAY) 0x2c4cf8 2025-05-07T20:11:02.2488363Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:11:02.2488720Z 0x000000000000001a (FINI_ARRAY) 0x2c4d40 2025-05-07T20:11:02.2489112Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.2489593Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:02.2489915Z 0x000000006ffffef5 (GNU_HASH) 0x18b0 2025-05-07T20:11:02.2490262Z 0x0000000000000005 (STRTAB) 0x5790 2025-05-07T20:11:02.2490573Z 0x0000000000000006 (SYMTAB) 0x2820 2025-05-07T20:11:02.2490917Z 0x000000000000000a (STRSZ) 40152 (bytes) 2025-05-07T20:11:02.2491264Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.2491606Z 0x0000000000000003 (PLTGOT) 0x2c53f8 2025-05-07T20:11:02.2491950Z 0x0000000000000002 (PLTRELSZ) 6768 (bytes) 2025-05-07T20:11:02.2492336Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.2492674Z 0x0000000000000017 (JMPREL) 0x10f38 2025-05-07T20:11:02.2492986Z 0x0000000000000007 (RELA) 0xf990 2025-05-07T20:11:02.2493380Z 0x0000000000000008 (RELASZ) 5544 (bytes) 2025-05-07T20:11:02.2493721Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.2494050Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.2494366Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.2494711Z 0x000000006ffffffe (VERNEED) 0xf860 2025-05-07T20:11:02.2495032Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:02.2495377Z 0x000000006ffffff0 (VERSYM) 0xf468 2025-05-07T20:11:02.2495710Z 0x000000006ffffff9 (RELACOUNT) 17 2025-05-07T20:11:02.2496003Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.2496197Z 2025-05-07T20:11:02.2496340Z ################################################################################ 2025-05-07T20:11:02.2496560Z 2025-05-07T20:11:02.2496564Z 2025-05-07T20:11:02.2496680Z ################################################################################ 2025-05-07T20:11:02.2497164Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:02.2497639Z [CHECK] Listing out library size: 2025-05-07T20:11:02.2498088Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:02.2498429Z 2025-05-07T20:11:02.2498635Z 21 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:02.2498919Z 2025-05-07T20:11:02.2499275Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:02.2500178Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.2500718Z 2025-05-07T20:11:02.2504705Z GLIBC_2.2.5 2025-05-07T20:11:02.2505283Z GLIBC_2.14 2025-05-07T20:11:02.2505641Z 2025-05-07T20:11:02.2505654Z 2025-05-07T20:11:02.2506800Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:02.2509697Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.2511492Z 2025-05-07T20:11:02.2583669Z GLIBCXX_3.4 2025-05-07T20:11:02.2584333Z GLIBCXX_3.4.9 2025-05-07T20:11:02.2585268Z GLIBCXX_3.4.11 2025-05-07T20:11:02.2585843Z GLIBCXX_3.4.20 2025-05-07T20:11:02.2586063Z GLIBCXX_3.4.21 2025-05-07T20:11:02.2586300Z GLIBCXX_3.4.29 2025-05-07T20:11:02.2586430Z 2025-05-07T20:11:02.2586435Z 2025-05-07T20:11:02.2601648Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.i4c6I7aaFR.symbols.txt 2025-05-07T20:11:02.2603001Z 2025-05-07T20:11:02.2644650Z 2025-05-07T20:11:02.2671194Z [CHECK] Total Number of symbols: 811 2025-05-07T20:11:02.2693378Z [CHECK] Number of fbgemm symbols: 80 2025-05-07T20:11:02.2713024Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.PVMNozJ7nH.usymbols.txt 2025-05-07T20:11:02.2713542Z 2025-05-07T20:11:02.2744227Z 2025-05-07T20:11:02.2778657Z [CHECK] Listing out undefined symbols (152 total): 2025-05-07T20:11:02.2798232Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.2798817Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.2799224Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.2799648Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.2800040Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.2800452Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:02.2801001Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:02.2801396Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:02.2801833Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.2802346Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:02.2802682Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:02.2803043Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:02.2803370Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:02.2803726Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:02.2804068Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:02.2804429Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:02.2804764Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.2805163Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:02.2805611Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:02.2806373Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.2807553Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.2808966Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.2810008Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:02.2810940Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.2811929Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:02.2812760Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:02.2813794Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.2814915Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.2815784Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:02.2816182Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:11:02.2816689Z U c10::BoolType::get() 2025-05-07T20:11:02.2817046Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:02.2817453Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:02.2817841Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.2818274Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:02.2818638Z U c10::IntType::get() 2025-05-07T20:11:02.2819044Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.2819539Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:02.2819928Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:02.2820570Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:02.2821225Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:02.2821581Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:02.2821974Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:02.2822355Z U c10::TensorType::get() 2025-05-07T20:11:02.2822699Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:02.2823614Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:02.2824521Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:02.2824905Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:02.2825240Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:02.2825597Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:02.2825956Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:02.2826289Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:02.2826770Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:02.2827221Z U c10::cuda::current_device() 2025-05-07T20:11:02.2827558Z U c10::cuda::device_count() 2025-05-07T20:11:02.2827951Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:02.2828366Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:02.2828769Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:02.2829150Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:02.2829568Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:02.2829946Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:02.2830665Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:02.2831494Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:02.2832289Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.2833169Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:02.2833732Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:02.2834041Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:02.2834396Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:02.2834793Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:02.2835184Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:02.2835544Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:02.2835907Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:02.2836349Z U c10::throwNullDataPtrError() 2025-05-07T20:11:02.2836838Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:02.2837179Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:02.2837588Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:02.2838027Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:02.2838394Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:02.2838758Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:02.2839141Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:02.2839493Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:02.2839885Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:02.2840227Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:02.2840598Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:02.2840957Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:02.2841311Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:02.2841668Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:02.2842001Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:02.2842356Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:02.2842687Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:02.2843034Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:02.2843367Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:02.2843721Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:02.2844231Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:02.2844824Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:02.2845173Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:02.2845502Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:02.2845850Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:02.2846224Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:02.2846867Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.2847344Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.2847718Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.2848075Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:02.2848437Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:02.2848873Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.2849286Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.2849652Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.2849977Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.2850267Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.2850636Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.2851026Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.2851631Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:02.2852572Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.2853537Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:02.2854136Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.2854507Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.2854929Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.2855365Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:02.2855778Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:02.2856272Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:02.2856960Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:02.2857992Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.2858838Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:02.2859192Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:02.2859596Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.2859967Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.2860313Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:02.2860673Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:02.2861073Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.2861629Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.2862094Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:02.2862433Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.2862780Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:02.2863597Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:02.2864744Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.2865590Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.2866299Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:02.2866988Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.2867456Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.2867885Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:02.2868316Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.2868917Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.2869689Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.2870328Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:02.2870853Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:02.2871331Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.2871650Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.2871959Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.2872247Z w __gmon_start__ 2025-05-07T20:11:02.2872528Z w __pthread_key_create 2025-05-07T20:11:02.2872830Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:02.2873166Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:02.2873533Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.2873972Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:02.2874274Z 2025-05-07T20:11:02.2874415Z linux-vdso.so.1 (0x00007ffcc208f000) 2025-05-07T20:11:02.2874705Z libtorch.so => not found 2025-05-07T20:11:02.2874969Z libc10.so => not found 2025-05-07T20:11:02.2875210Z libc10_cuda.so => not found 2025-05-07T20:11:02.2875486Z libtorch_cpu.so => not found 2025-05-07T20:11:02.2875752Z libtorch_cuda.so => not found 2025-05-07T20:11:02.2876083Z libcudart.so.12 => not found 2025-05-07T20:11:02.2876432Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f449e79c000) 2025-05-07T20:11:02.2876994Z libm.so.6 => /lib64/libm.so.6 (0x00007f44a011f000) 2025-05-07T20:11:02.2877448Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f44a00f1000) 2025-05-07T20:11:02.2879328Z libc.so.6 => /lib64/libc.so.6 (0x00007f449e594000) 2025-05-07T20:11:02.2879706Z /lib64/ld-linux-x86-64.so.2 (0x00007f44a0200000) 2025-05-07T20:11:02.2879942Z 2025-05-07T20:11:02.2880117Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.2880547Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:02.2880902Z 2025-05-07T20:11:02.2880924Z 2025-05-07T20:11:02.2881091Z Dynamic section at offset 0x14c3b48 contains 37 entries: 2025-05-07T20:11:02.2881492Z Tag Type Name/Value 2025-05-07T20:11:02.2881918Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.2882435Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:02.2882937Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:02.2883474Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.2883997Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.2884536Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:02.2885084Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.2885583Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:02.2886136Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.2886634Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.2887195Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:02.2887638Z 0x000000000000000c (INIT) 0x2a000 2025-05-07T20:11:02.2887991Z 0x000000000000000d (FINI) 0xe445c 2025-05-07T20:11:02.2888359Z 0x0000000000000019 (INIT_ARRAY) 0x14c31b0 2025-05-07T20:11:02.2888714Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:11:02.2889077Z 0x000000000000001a (FINI_ARRAY) 0x14c3280 2025-05-07T20:11:02.2889418Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.2889752Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:02.2890082Z 0x000000006ffffef5 (GNU_HASH) 0x1eb8 2025-05-07T20:11:02.2890405Z 0x0000000000000005 (STRTAB) 0x86a8 2025-05-07T20:11:02.2890717Z 0x0000000000000006 (SYMTAB) 0x3a88 2025-05-07T20:11:02.2891088Z 0x000000000000000a (STRSZ) 113475 (bytes) 2025-05-07T20:11:02.2891456Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.2891826Z 0x0000000000000003 (PLTGOT) 0x14c3de8 2025-05-07T20:11:02.2892188Z 0x0000000000000002 (PLTRELSZ) 8736 (bytes) 2025-05-07T20:11:02.2892532Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.2892867Z 0x0000000000000017 (JMPREL) 0x27c08 2025-05-07T20:11:02.2893187Z 0x0000000000000007 (RELA) 0x24968 2025-05-07T20:11:02.2893553Z 0x0000000000000008 (RELASZ) 12960 (bytes) 2025-05-07T20:11:02.2893928Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.2894248Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.2894601Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.2894951Z 0x000000006ffffffe (VERNEED) 0x24848 2025-05-07T20:11:02.2895304Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:02.2895622Z 0x000000006ffffff0 (VERSYM) 0x241ec 2025-05-07T20:11:02.2895966Z 0x000000006ffffff9 (RELACOUNT) 39 2025-05-07T20:11:02.2896274Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.2896502Z 2025-05-07T20:11:02.2896617Z ################################################################################ 2025-05-07T20:11:02.2896840Z 2025-05-07T20:11:02.2896844Z 2025-05-07T20:11:02.2896977Z ################################################################################ 2025-05-07T20:11:02.2897497Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:02.2898003Z [CHECK] Listing out library size: 2025-05-07T20:11:02.2898518Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:02.2899015Z 2025-05-07T20:11:02.2899227Z 9 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:02.2899537Z 2025-05-07T20:11:02.2899941Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:02.2900930Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.2901549Z 2025-05-07T20:11:02.2957913Z GLIBC_2.2.5 2025-05-07T20:11:02.2958567Z GLIBC_2.3 2025-05-07T20:11:02.2959110Z GLIBC_2.14 2025-05-07T20:11:02.2961539Z 2025-05-07T20:11:02.2961555Z 2025-05-07T20:11:02.2962835Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:02.2965853Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.2966474Z 2025-05-07T20:11:02.3027950Z GLIBCXX_3.4 2025-05-07T20:11:02.3028939Z GLIBCXX_3.4.9 2025-05-07T20:11:02.3029536Z GLIBCXX_3.4.11 2025-05-07T20:11:02.3030144Z GLIBCXX_3.4.18 2025-05-07T20:11:02.3030718Z GLIBCXX_3.4.21 2025-05-07T20:11:02.3031313Z GLIBCXX_3.4.29 2025-05-07T20:11:02.3031686Z 2025-05-07T20:11:02.3031699Z 2025-05-07T20:11:02.3056624Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.L9so5OSCo1.symbols.txt 2025-05-07T20:11:02.3057097Z 2025-05-07T20:11:02.3084268Z 2025-05-07T20:11:02.3112700Z [CHECK] Total Number of symbols: 342 2025-05-07T20:11:02.3131381Z [CHECK] Number of fbgemm symbols: 14 2025-05-07T20:11:02.3149132Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.8kSKV0bOmt.usymbols.txt 2025-05-07T20:11:02.3149641Z 2025-05-07T20:11:02.3172806Z 2025-05-07T20:11:02.3202414Z [CHECK] Listing out undefined symbols (129 total): 2025-05-07T20:11:02.3225974Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.3228421Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.3229987Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.3231003Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.3231392Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.3231786Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.3232163Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:02.3232545Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:02.3232902Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:02.3233261Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.3233627Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:02.3234043Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:02.3234369Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:02.3234674Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:02.3234992Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:02.3235313Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.3235643Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:02.3235980Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:02.3236500Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:02.3237220Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:02.3237735Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:02.3238121Z U c10::BoolType::get() 2025-05-07T20:11:02.3238519Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:02.3238919Z U c10::FloatType::get() 2025-05-07T20:11:02.3239251Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:02.3239644Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.3240089Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:02.3240440Z U c10::IntType::get() 2025-05-07T20:11:02.3240816Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:02.3241230Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:02.3241603Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:02.3242018Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:02.3242682Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:02.3243347Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:02.3243755Z U c10::TensorType::get() 2025-05-07T20:11:02.3244085Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:02.3245040Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:02.3246002Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:02.3246390Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:02.3246968Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:02.3247321Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:02.3247687Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:02.3248042Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:02.3248531Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:02.3249010Z U c10::cuda::device_count() 2025-05-07T20:11:02.3249358Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:02.3249804Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:02.3250202Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:02.3250610Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:02.3251017Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:02.3251426Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:02.3252212Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:02.3253156Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:02.3253994Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.3254909Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:02.3255453Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:02.3255816Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:02.3256147Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:02.3256564Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:02.3256966Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:02.3257308Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:02.3257753Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:02.3258172Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:02.3258552Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:02.3258899Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:02.3259263Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:02.3259619Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:02.3259947Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:02.3260306Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:02.3260665Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:02.3261052Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:02.3261384Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:02.3261747Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:02.3262078Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:02.3262439Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:02.3262845Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:02.3263183Z U float at::Tensor::item() const 2025-05-07T20:11:02.3263578Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.3263978Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.3264392Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.3264740Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.3265057Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.3265366Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.3265699Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.3266137Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.3266854Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:02.3267868Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.3268740Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:02.3269622Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:02.3270249Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.3270607Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:02.3271008Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.3271444Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.3271843Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:02.3272360Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:02.3273072Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:02.3274140Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.3275397Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.3276259Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:02.3276636Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:02.3277060Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.3277431Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.3277814Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:02.3278161Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:02.3278600Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.3279178Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.3279667Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:02.3280056Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:02.3280383Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.3280738Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:02.3281598Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:02.3282775Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.3283660Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.3284430Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:02.3285070Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.3285532Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:02.3285991Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.3286647Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.3287473Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.3288376Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.3289054Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:02.3289568Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:02.3290023Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.3290370Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.3290677Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.3290990Z w __gmon_start__ 2025-05-07T20:11:02.3291263Z w __pthread_key_create 2025-05-07T20:11:02.3291590Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:02.3291912Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:02.3292310Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.3292796Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:02.3293102Z 2025-05-07T20:11:02.3293208Z linux-vdso.so.1 (0x00007ffd08b89000) 2025-05-07T20:11:02.3293525Z libtorch.so => not found 2025-05-07T20:11:02.3293764Z libc10.so => not found 2025-05-07T20:11:02.3294030Z libc10_cuda.so => not found 2025-05-07T20:11:02.3294291Z libtorch_cpu.so => not found 2025-05-07T20:11:02.3294571Z libtorch_cuda.so => not found 2025-05-07T20:11:02.3294855Z libcudart.so.12 => not found 2025-05-07T20:11:02.3295176Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fecf8d9c000) 2025-05-07T20:11:02.3295617Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fecf9ad0000) 2025-05-07T20:11:02.3295984Z libc.so.6 => /lib64/libc.so.6 (0x00007fecf8b94000) 2025-05-07T20:11:02.3296361Z /lib64/ld-linux-x86-64.so.2 (0x00007fecf9b04000) 2025-05-07T20:11:02.3296706Z libm.so.6 => /lib64/libm.so.6 (0x00007fecf99f5000) 2025-05-07T20:11:02.3296939Z 2025-05-07T20:11:02.3297048Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.3297470Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:02.3297824Z 2025-05-07T20:11:02.3307139Z 2025-05-07T20:11:02.3307327Z Dynamic section at offset 0x8a8558 contains 37 entries: 2025-05-07T20:11:02.3307777Z Tag Type Name/Value 2025-05-07T20:11:02.3308307Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.3308866Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:02.3309402Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:02.3309924Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.3310472Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.3310993Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:02.3311574Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.3312093Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.3312610Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.3313174Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:02.3313734Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:11:02.3314229Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:11:02.3314562Z 0x000000000000000d (FINI) 0x3464c 2025-05-07T20:11:02.3314926Z 0x0000000000000019 (INIT_ARRAY) 0x8a82d8 2025-05-07T20:11:02.3315294Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:11:02.3315649Z 0x000000000000001a (FINI_ARRAY) 0x8a8308 2025-05-07T20:11:02.3316104Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.3316446Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:02.3316795Z 0x000000006ffffef5 (GNU_HASH) 0xf58 2025-05-07T20:11:02.3317129Z 0x0000000000000005 (STRTAB) 0x3a30 2025-05-07T20:11:02.3317513Z 0x0000000000000006 (SYMTAB) 0x1a08 2025-05-07T20:11:02.3317865Z 0x000000000000000a (STRSZ) 36563 (bytes) 2025-05-07T20:11:02.3318258Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.3318615Z 0x0000000000000003 (PLTGOT) 0x8a87f8 2025-05-07T20:11:02.3318981Z 0x0000000000000002 (PLTRELSZ) 3600 (bytes) 2025-05-07T20:11:02.3319345Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.3319672Z 0x0000000000000017 (JMPREL) 0xe920 2025-05-07T20:11:02.3320027Z 0x0000000000000007 (RELA) 0xcce8 2025-05-07T20:11:02.3320379Z 0x0000000000000008 (RELASZ) 7224 (bytes) 2025-05-07T20:11:02.3320753Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.3321099Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.3321438Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.3321800Z 0x000000006ffffffe (VERNEED) 0xcbb8 2025-05-07T20:11:02.3322143Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:02.3322492Z 0x000000006ffffff0 (VERSYM) 0xc904 2025-05-07T20:11:02.3322821Z 0x000000006ffffff9 (RELACOUNT) 90 2025-05-07T20:11:02.3323154Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.3323359Z 2025-05-07T20:11:02.3323475Z ################################################################################ 2025-05-07T20:11:02.3323760Z 2025-05-07T20:11:02.3323764Z 2025-05-07T20:11:02.3323880Z ################################################################################ 2025-05-07T20:11:02.3324411Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:02.3324891Z [CHECK] Listing out library size: 2025-05-07T20:11:02.3325359Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:02.3325721Z 2025-05-07T20:11:02.3325940Z 17 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:02.3326239Z 2025-05-07T20:11:02.3326615Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:02.3327588Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.3328157Z 2025-05-07T20:11:02.3389843Z GLIBC_2.2.5 2025-05-07T20:11:02.3390205Z GLIBC_2.14 2025-05-07T20:11:02.3390421Z 2025-05-07T20:11:02.3390426Z 2025-05-07T20:11:02.3390860Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:02.3391858Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.3392624Z 2025-05-07T20:11:02.3447257Z GLIBCXX_3.4 2025-05-07T20:11:02.3447527Z GLIBCXX_3.4.9 2025-05-07T20:11:02.3447768Z GLIBCXX_3.4.20 2025-05-07T20:11:02.3448012Z GLIBCXX_3.4.21 2025-05-07T20:11:02.3448223Z GLIBCXX_3.4.29 2025-05-07T20:11:02.3449493Z 2025-05-07T20:11:02.3449525Z 2025-05-07T20:11:02.3471830Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.Yn7miyJt7c.symbols.txt 2025-05-07T20:11:02.3472352Z 2025-05-07T20:11:02.3502201Z 2025-05-07T20:11:02.3534344Z [CHECK] Total Number of symbols: 469 2025-05-07T20:11:02.3548141Z [CHECK] Number of fbgemm symbols: 12 2025-05-07T20:11:02.3563384Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.WGaxGalynQ.usymbols.txt 2025-05-07T20:11:02.3563865Z 2025-05-07T20:11:02.3592214Z 2025-05-07T20:11:02.3621251Z [CHECK] Listing out undefined symbols (155 total): 2025-05-07T20:11:02.3638185Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.3638861Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.3639239Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.3639808Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.3640215Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.3640593Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:02.3641002Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:02.3641364Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:02.3641750Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.3642126Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:02.3642484Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:02.3642805Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:02.3643154Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:02.3643490Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:02.3643810Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:02.3644163Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:02.3644481Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:02.3644813Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:02.3645134Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.3645677Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:02.3646227Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:02.3647365Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.3648732Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.3649686Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:02.3650146Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:02.3650641Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:02.3651149Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:02.3651622Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:02.3652347Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.3653547Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.3654357Z U c10::BoolType::get() 2025-05-07T20:11:02.3654745Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:02.3655172Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:02.3655559Z U c10::IntType::get() 2025-05-07T20:11:02.3655937Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:02.3656353Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:02.3656814Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.3657318Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:02.3657757Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:02.3658431Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:02.3659207Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:02.3659679Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:02.3660025Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:02.3660378Z U c10::SymIntType::get() 2025-05-07T20:11:02.3660747Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:02.3661191Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:02.3661561Z U c10::TensorType::get() 2025-05-07T20:11:02.3661898Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:02.3662962Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:02.3663891Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:02.3664269Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:02.3664620Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:02.3664977Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:02.3665335Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:02.3665672Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:02.3666195Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:02.3666665Z U c10::cuda::current_device() 2025-05-07T20:11:02.3667025Z U c10::cuda::device_count() 2025-05-07T20:11:02.3667390Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:02.3667767Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:02.3668147Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:02.3668539Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:02.3668956Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:02.3669336Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:02.3670077Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:02.3670953Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:02.3671872Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.3672794Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:02.3673772Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.3674532Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:02.3674866Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:02.3675244Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:02.3675653Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:02.3676123Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:02.3676650Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:02.3677076Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:02.3677513Z U c10::throwNullDataPtrError() 2025-05-07T20:11:02.3677866Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:02.3678226Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:02.3678690Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:02.3679147Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:02.3679505Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:02.3679897Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:02.3680278Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:02.3680666Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:02.3681050Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:02.3681405Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:02.3681771Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:02.3682184Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:02.3682548Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:02.3682925Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:02.3683273Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:02.3683668Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:02.3684042Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:02.3684411Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:02.3684753Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:02.3685172Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:02.3685534Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:02.3686077Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:02.3686623Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:02.3686971Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:02.3687333Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:02.3687691Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:02.3688076Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:02.3688464Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.3688848Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.3689236Z U log2@GLIBC_2.2.5 2025-05-07T20:11:02.3689612Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:02.3690064Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.3690465Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.3690841Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.3691154Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.3691477Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.3691827Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.3692210Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.3692802Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:02.3693657Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.3694500Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:02.3695177Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.3695530Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.3696013Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.3696414Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:02.3696896Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:02.3697577Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:02.3698531Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.3699276Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:02.3699621Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:02.3699941Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:02.3700276Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.3700591Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.3700920Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:02.3701238Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:02.3701609Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.3702113Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.3702554Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:02.3702880Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:02.3703210Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.3703498Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:02.3704290Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:02.3705350Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.3706120Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.3706804Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:02.3707353Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:02.3707716Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.3708111Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:02.3708503Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.3709078Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.3709825Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.3710421Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:02.3710926Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:02.3711334Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.3711641Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.3711926Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.3712207Z w __gmon_start__ 2025-05-07T20:11:02.3712622Z w __pthread_key_create 2025-05-07T20:11:02.3712960Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.3713574Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:02.3713881Z 2025-05-07T20:11:02.3713994Z linux-vdso.so.1 (0x00007ffeabbfa000) 2025-05-07T20:11:02.3714302Z libtorch.so => not found 2025-05-07T20:11:02.3714545Z libc10.so => not found 2025-05-07T20:11:02.3714794Z libc10_cuda.so => not found 2025-05-07T20:11:02.3715139Z libtorch_cpu.so => not found 2025-05-07T20:11:02.3715446Z libtorch_cuda.so => not found 2025-05-07T20:11:02.3715708Z libcudart.so.12 => not found 2025-05-07T20:11:02.3716119Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f9f36f9c000) 2025-05-07T20:11:02.3716524Z libm.so.6 => /lib64/libm.so.6 (0x00007f9f3838f000) 2025-05-07T20:11:02.3716897Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f9f38361000) 2025-05-07T20:11:02.3717285Z libc.so.6 => /lib64/libc.so.6 (0x00007f9f36d94000) 2025-05-07T20:11:02.3717641Z /lib64/ld-linux-x86-64.so.2 (0x00007f9f38470000) 2025-05-07T20:11:02.3717911Z 2025-05-07T20:11:02.3718021Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.3718458Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:02.3718791Z 2025-05-07T20:11:02.3718797Z 2025-05-07T20:11:02.3718959Z Dynamic section at offset 0x106d2d0 contains 37 entries: 2025-05-07T20:11:02.3719344Z Tag Type Name/Value 2025-05-07T20:11:02.3719764Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.3720269Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:02.3720762Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:02.3721277Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.3721856Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.3722381Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:02.3722930Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.3723438Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:02.3723963Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.3724465Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.3724991Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:02.3725449Z 0x000000000000000c (INIT) 0x12000 2025-05-07T20:11:02.3725777Z 0x000000000000000d (FINI) 0xa2d3c 2025-05-07T20:11:02.3726123Z 0x0000000000000019 (INIT_ARRAY) 0x106de30 2025-05-07T20:11:02.3726470Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:11:02.3726827Z 0x000000000000001a (FINI_ARRAY) 0x106de90 2025-05-07T20:11:02.3727169Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.3727531Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:02.3727868Z 0x000000006ffffef5 (GNU_HASH) 0x1640 2025-05-07T20:11:02.3728190Z 0x0000000000000005 (STRTAB) 0x51f0 2025-05-07T20:11:02.3728575Z 0x0000000000000006 (SYMTAB) 0x25e0 2025-05-07T20:11:02.3728931Z 0x000000000000000a (STRSZ) 38760 (bytes) 2025-05-07T20:11:02.3729317Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.3729677Z 0x0000000000000003 (PLTGOT) 0x106e570 2025-05-07T20:11:02.3730068Z 0x0000000000000002 (PLTRELSZ) 5376 (bytes) 2025-05-07T20:11:02.3730426Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.3730781Z 0x0000000000000017 (JMPREL) 0x10600 2025-05-07T20:11:02.3731141Z 0x0000000000000007 (RELA) 0xee18 2025-05-07T20:11:02.3731495Z 0x0000000000000008 (RELASZ) 6120 (bytes) 2025-05-07T20:11:02.3731882Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.3732218Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.3732584Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.3732947Z 0x000000006ffffffe (VERNEED) 0xed08 2025-05-07T20:11:02.3733323Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:02.3733687Z 0x000000006ffffff0 (VERSYM) 0xe958 2025-05-07T20:11:02.3734024Z 0x000000006ffffff9 (RELACOUNT) 26 2025-05-07T20:11:02.3734390Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.3734606Z 2025-05-07T20:11:02.3734729Z ################################################################################ 2025-05-07T20:11:02.3734985Z 2025-05-07T20:11:02.3734990Z 2025-05-07T20:11:02.3735122Z ################################################################################ 2025-05-07T20:11:02.3735679Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:02.3736207Z [CHECK] Listing out library size: 2025-05-07T20:11:02.3736719Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:02.3737124Z 2025-05-07T20:11:02.3737352Z 2 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:02.3737711Z 2025-05-07T20:11:02.3738133Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:02.3739197Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.3739827Z 2025-05-07T20:11:02.3791548Z GLIBC_2.2.5 2025-05-07T20:11:02.3791917Z GLIBC_2.3 2025-05-07T20:11:02.3792145Z GLIBC_2.14 2025-05-07T20:11:02.3793925Z 2025-05-07T20:11:02.3793941Z 2025-05-07T20:11:02.3794587Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:02.3795773Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.3796527Z 2025-05-07T20:11:02.3848380Z GLIBCXX_3.4 2025-05-07T20:11:02.3848649Z GLIBCXX_3.4.9 2025-05-07T20:11:02.3848915Z GLIBCXX_3.4.21 2025-05-07T20:11:02.3849142Z GLIBCXX_3.4.29 2025-05-07T20:11:02.3849279Z 2025-05-07T20:11:02.3849307Z 2025-05-07T20:11:02.3870132Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.lScfYo3bjY.symbols.txt 2025-05-07T20:11:02.3870654Z 2025-05-07T20:11:02.3889458Z 2025-05-07T20:11:02.3918205Z [CHECK] Total Number of symbols: 326 2025-05-07T20:11:02.3938048Z [CHECK] Number of fbgemm symbols: 56 2025-05-07T20:11:02.3952732Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.ycVtcNyS6l.usymbols.txt 2025-05-07T20:11:02.3953256Z 2025-05-07T20:11:02.3976003Z 2025-05-07T20:11:02.3999294Z [CHECK] Listing out undefined symbols (143 total): 2025-05-07T20:11:02.4019495Z U GOMP_parallel 2025-05-07T20:11:02.4020476Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.4036931Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.4037472Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.4037917Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.4038313Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.4038718Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:02.4039100Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:02.4039496Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:02.4039896Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.4040249Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:02.4040589Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:02.4040914Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:02.4041260Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:02.4041586Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:02.4041941Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:02.4042268Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.4042723Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:02.4043152Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:11:02.4044046Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.4045400Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.4046357Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:02.4047083Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:02.4047817Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.4048528Z U at::get_num_threads() 2025-05-07T20:11:02.4048842Z U at::get_thread_num() 2025-05-07T20:11:02.4049172Z U at::in_parallel_region() 2025-05-07T20:11:02.4049475Z U at::init_num_threads() 2025-05-07T20:11:02.4049816Z U at::internal::set_thread_num(int) 2025-05-07T20:11:02.4050728Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.4051719Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:02.4052311Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.4052765Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:02.4053209Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.4053638Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:02.4054023Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:02.4054404Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:02.4054804Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:02.4055238Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:02.4055657Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:02.4056098Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:02.4056490Z U c10::TensorType::get() 2025-05-07T20:11:02.4056870Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:02.4057845Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:02.4058827Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:02.4059490Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:02.4059864Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:02.4060250Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:02.4060618Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:02.4060973Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:02.4061469Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:02.4061971Z U c10::cuda::device_count() 2025-05-07T20:11:02.4062321Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:02.4062731Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:02.4062921Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:02.4063073Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:02.4063252Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:02.4063374Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:02.4063904Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:02.4064179Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:02.4064683Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.4065031Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:02.4065175Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:02.4065294Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:02.4065456Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:02.4065674Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:02.4065803Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:02.4065958Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:02.4066142Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:02.4066266Z U c10::throwNullDataPtrError() 2025-05-07T20:11:02.4066384Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:02.4066513Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:02.4066725Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:02.4066847Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:02.4066983Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:02.4067135Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:02.4067276Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:02.4067396Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:02.4067544Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:02.4067665Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:02.4067787Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:02.4067931Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:02.4068088Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:02.4068199Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:02.4068324Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:02.4068462Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:02.4068577Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:02.4068688Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:02.4068989Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:02.4069117Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:02.4069235Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:02.4069376Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:02.4069511Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:02.4069637Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:02.4069790Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.4069952Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.4070163Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:02.4070304Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.4070428Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.4070530Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.4070634Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.4070754Z U omp_get_num_threads 2025-05-07T20:11:02.4070854Z U omp_get_thread_num 2025-05-07T20:11:02.4071008Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.4071142Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.4071511Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:02.4071907Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.4072277Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:02.4072432Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:02.4072581Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:02.4072728Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.4072904Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.4073051Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.4073324Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:02.4073703Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:02.4074289Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.4074440Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:02.4074571Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:02.4074704Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:02.4074850Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.4074979Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.4075102Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:02.4075218Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:02.4075424Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.4075682Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:02.4075784Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.4075929Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:02.4076622Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:02.4077117Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.4077381Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.4077753Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:02.4077941Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.4078112Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:02.4078311Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.4078692Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.4079029Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.4079235Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:02.4079489Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:02.4079612Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.4079729Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.4079863Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.4079959Z w __gmon_start__ 2025-05-07T20:11:02.4080062Z w __pthread_key_create 2025-05-07T20:11:02.4080218Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.4080480Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:02.4080488Z 2025-05-07T20:11:02.4080626Z linux-vdso.so.1 (0x00007ffd8d3e3000) 2025-05-07T20:11:02.4080720Z libc10.so => not found 2025-05-07T20:11:02.4080849Z libc10_cuda.so => not found 2025-05-07T20:11:02.4080975Z libtorch.so => not found 2025-05-07T20:11:02.4081079Z libtorch_cpu.so => not found 2025-05-07T20:11:02.4081207Z libtorch_cuda.so => not found 2025-05-07T20:11:02.4081305Z libcudart.so.12 => not found 2025-05-07T20:11:02.4081513Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa14cd61000) 2025-05-07T20:11:02.4081672Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa14cd33000) 2025-05-07T20:11:02.4081827Z libc.so.6 => /lib64/libc.so.6 (0x00007fa14cb2b000) 2025-05-07T20:11:02.4081961Z /lib64/ld-linux-x86-64.so.2 (0x00007fa14d17b000) 2025-05-07T20:11:02.4082091Z libm.so.6 => /lib64/libm.so.6 (0x00007fa14ca50000) 2025-05-07T20:11:02.4082096Z 2025-05-07T20:11:02.4082233Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.4082499Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:02.4082504Z 2025-05-07T20:11:02.4096764Z 2025-05-07T20:11:02.4097204Z Dynamic section at offset 0x179670 contains 38 entries: 2025-05-07T20:11:02.4097429Z Tag Type Name/Value 2025-05-07T20:11:02.4097653Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:02.4097955Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:02.4098209Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.4098524Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.4098756Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.4098968Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:02.4099176Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.4099396Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.4099591Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.4099813Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:02.4100093Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:02.4100284Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:02.4100406Z 0x000000000000000c (INIT) 0xc000 2025-05-07T20:11:02.4100530Z 0x000000000000000d (FINI) 0x237dc 2025-05-07T20:11:02.4100671Z 0x0000000000000019 (INIT_ARRAY) 0x1792c0 2025-05-07T20:11:02.4100801Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:11:02.4100962Z 0x000000000000001a (FINI_ARRAY) 0x1792e0 2025-05-07T20:11:02.4101110Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.4101224Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:02.4101347Z 0x000000006ffffef5 (GNU_HASH) 0x10f8 2025-05-07T20:11:02.4101483Z 0x0000000000000005 (STRTAB) 0x38a8 2025-05-07T20:11:02.4101596Z 0x0000000000000006 (SYMTAB) 0x1a00 2025-05-07T20:11:02.4101749Z 0x000000000000000a (STRSZ) 24404 (bytes) 2025-05-07T20:11:02.4101879Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.4102028Z 0x0000000000000003 (PLTGOT) 0x179910 2025-05-07T20:11:02.4102165Z 0x0000000000000002 (PLTRELSZ) 3864 (bytes) 2025-05-07T20:11:02.4102278Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.4102416Z 0x0000000000000017 (JMPREL) 0xaba8 2025-05-07T20:11:02.4102525Z 0x0000000000000007 (RELA) 0x9ba0 2025-05-07T20:11:02.4102661Z 0x0000000000000008 (RELASZ) 4104 (bytes) 2025-05-07T20:11:02.4102805Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.4102910Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.4103036Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.4103159Z 0x000000006ffffffe (VERNEED) 0x9a90 2025-05-07T20:11:02.4103329Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:02.4103452Z 0x000000006ffffff0 (VERSYM) 0x97fc 2025-05-07T20:11:02.4103564Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:11:02.4103723Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.4103729Z 2025-05-07T20:11:02.4103853Z ################################################################################ 2025-05-07T20:11:02.4103860Z 2025-05-07T20:11:02.4103864Z 2025-05-07T20:11:02.4103984Z ################################################################################ 2025-05-07T20:11:02.4104339Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:02.4104452Z [CHECK] Listing out library size: 2025-05-07T20:11:02.4104771Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:02.4104775Z 2025-05-07T20:11:02.4112903Z 8 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:02.4112917Z 2025-05-07T20:11:02.4113378Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:02.4113928Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.4113990Z 2025-05-07T20:11:02.4545938Z GLIBC_2.2.5 2025-05-07T20:11:02.4546142Z GLIBC_2.3 2025-05-07T20:11:02.4546271Z GLIBC_2.14 2025-05-07T20:11:02.4548423Z 2025-05-07T20:11:02.4548439Z 2025-05-07T20:11:02.4548951Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:02.4549529Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.4549534Z 2025-05-07T20:11:02.4982092Z GLIBCXX_3.4 2025-05-07T20:11:02.4982217Z GLIBCXX_3.4.9 2025-05-07T20:11:02.4982320Z GLIBCXX_3.4.11 2025-05-07T20:11:02.4982439Z GLIBCXX_3.4.15 2025-05-07T20:11:02.4982644Z GLIBCXX_3.4.18 2025-05-07T20:11:02.4982786Z GLIBCXX_3.4.20 2025-05-07T20:11:02.4982883Z GLIBCXX_3.4.21 2025-05-07T20:11:02.4983003Z GLIBCXX_3.4.29 2025-05-07T20:11:02.4983014Z 2025-05-07T20:11:02.4983023Z 2025-05-07T20:11:02.5002210Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.xSoaHlQFJF.symbols.txt 2025-05-07T20:11:02.5002219Z 2025-05-07T20:11:02.5396760Z 2025-05-07T20:11:02.5429966Z [CHECK] Total Number of symbols: 4265 2025-05-07T20:11:02.5469150Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:11:02.5488071Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.Pr2z0NlZt2.usymbols.txt 2025-05-07T20:11:02.5488150Z 2025-05-07T20:11:02.5517962Z 2025-05-07T20:11:02.5545820Z [CHECK] Listing out undefined symbols (190 total): 2025-05-07T20:11:02.5566398Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.5566666Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.5567164Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:02.5567359Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:02.5567497Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:02.5567646Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:02.5567862Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:02.5567988Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:02.5568176Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:02.5568290Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:02.5568400Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:02.5568512Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:02.5569496Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.5569617Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:02.5569883Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:02.5570054Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:02.5570175Z U at::RecordFunction::end() 2025-05-07T20:11:02.5570316Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:02.5570500Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:02.5570821Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:02.5571127Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:02.5571480Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:02.5571702Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:11:02.5572353Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.5572608Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:02.5572784Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:02.5572922Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:02.5573105Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:02.5573247Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:02.5573358Z U c10::AnyType::get() 2025-05-07T20:11:02.5573490Z U c10::BoolType::get() 2025-05-07T20:11:02.5573663Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:02.5573854Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:02.5574005Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:02.5574525Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:02.5575163Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:02.5575611Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:02.5575729Z U c10::Error::what() const 2025-05-07T20:11:02.5575842Z U c10::FloatType::get() 2025-05-07T20:11:02.5575979Z U c10::GradMode::is_enabled() 2025-05-07T20:11:02.5576099Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:02.5576262Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:02.5576414Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:02.5576535Z U c10::IValue::isBoolList() const 2025-05-07T20:11:02.5576658Z U c10::IValue::isDoubleList() const 2025-05-07T20:11:02.5576800Z U c10::IValue::isIntList() const 2025-05-07T20:11:02.5576923Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:02.5577044Z U c10::IValue::isTensorList() const 2025-05-07T20:11:02.5577193Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:02.5577321Z U c10::IntType::get() 2025-05-07T20:11:02.5577831Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.5578015Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:02.5578192Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:02.5578328Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:02.5578460Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:02.5578708Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.5578997Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:02.5579107Z U c10::StringType::get() 2025-05-07T20:11:02.5579279Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:02.5579428Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:02.5579586Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:02.5579773Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:02.5580276Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:02.5580415Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:02.5580588Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:02.5580723Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:02.5580844Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:02.5580994Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:02.5581104Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:02.5581206Z U c10::SymIntType::get() 2025-05-07T20:11:02.5581348Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:02.5581453Z U c10::TensorType::get() 2025-05-07T20:11:02.5581575Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:02.5582002Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.5582488Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:02.5582739Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:02.5583256Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.5583580Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:02.5584131Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.5584460Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:02.5584644Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:02.5584766Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:02.5584941Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:02.5585320Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:02.5585466Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:02.5585625Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:02.5585806Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:02.5585970Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:02.5586178Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:02.5586292Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:02.5586548Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:02.5586816Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:02.5586918Z U free@GLIBC_2.2.5 2025-05-07T20:11:02.5587096Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:02.5587201Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:02.5587295Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.5587417Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.5587515Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.5587662Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.5587781Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.5587900Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:02.5588149Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:02.5588473Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:02.5588853Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.5589172Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:02.5589504Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:02.5589866Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.5590212Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:02.5590347Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.5590462Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:02.5590631Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.5590795Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.5590959Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:02.5591092Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:02.5591253Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:02.5591483Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:02.5591810Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:02.5592368Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.5592846Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.5592976Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:02.5593114Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:02.5593259Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:02.5593379Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.5593539Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.5593654Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:02.5593767Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:02.5593966Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.5594193Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.5594323Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:02.5594507Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:02.5594641Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:02.5595048Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:02.5595201Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:02.5595481Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:02.5595587Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:02.5595734Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.5595875Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:02.5596718Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:02.5597209Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.5597475Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.5597614Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:02.5597939Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:02.5598130Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:02.5598346Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:02.5598565Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:02.5598952Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:02.5599112Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:02.5599332Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:02.5599521Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:02.5599651Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:02.5599785Z U torch::autograd::Node::metadata() 2025-05-07T20:11:02.5599932Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:02.5600188Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:02.5600490Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:02.5600639Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:02.5600870Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:02.5601092Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:02.5603828Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:02.5604006Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:02.5604156Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:02.5604317Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:02.5605068Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:02.5605220Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:02.5605628Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:02.5605995Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:02.5606105Z U typeinfo for c10::Error 2025-05-07T20:11:02.5606243Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:02.5606386Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:02.5606520Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:02.5606650Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:02.5606786Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:02.5606931Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.5607093Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:02.5607254Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:02.5607435Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.5607540Z U vtable for c10::Error 2025-05-07T20:11:02.5607888Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.5608201Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.5608336Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:02.5608541Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:02.5608755Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:02.5608871Z U vtable for torch::autograd::Node 2025-05-07T20:11:02.5609066Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:02.5609180Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.5609291Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.5609421Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.5609516Z w __gmon_start__ 2025-05-07T20:11:02.5609614Z w __pthread_key_create 2025-05-07T20:11:02.5609726Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:02.5609875Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:02.5610017Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.5610284Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:02.5610291Z 2025-05-07T20:11:02.5614428Z linux-vdso.so.1 (0x00007ffef6dd3000) 2025-05-07T20:11:02.5614703Z libc10.so => not found 2025-05-07T20:11:02.5615195Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fa1bea00000) 2025-05-07T20:11:02.5615327Z libtorch.so => not found 2025-05-07T20:11:02.5615774Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007fa1bf551000) 2025-05-07T20:11:02.5615886Z libtorch_cpu.so => not found 2025-05-07T20:11:02.5616016Z libtorch_cuda.so => not found 2025-05-07T20:11:02.5616189Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa1be79c000) 2025-05-07T20:11:02.5616353Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa1bf521000) 2025-05-07T20:11:02.5616488Z libc.so.6 => /lib64/libc.so.6 (0x00007fa1be594000) 2025-05-07T20:11:02.5616655Z /lib64/ld-linux-x86-64.so.2 (0x00007fa1bf561000) 2025-05-07T20:11:02.5616755Z libc10.so => not found 2025-05-07T20:11:02.5616858Z libc10_cuda.so => not found 2025-05-07T20:11:02.5617243Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007fa1be000000) 2025-05-07T20:11:02.5617377Z libtorch.so => not found 2025-05-07T20:11:02.5617482Z libtorch_cpu.so => not found 2025-05-07T20:11:02.5617604Z libtorch_cuda.so => not found 2025-05-07T20:11:02.5617710Z libcudart.so.12 => not found 2025-05-07T20:11:02.5617805Z libc10.so => not found 2025-05-07T20:11:02.5617909Z libtorch_cpu.so => not found 2025-05-07T20:11:02.5618027Z libtorch_cuda.so => not found 2025-05-07T20:11:02.5618125Z libtorch.so => not found 2025-05-07T20:11:02.5618258Z libm.so.6 => /lib64/libm.so.6 (0x00007fa1bdf25000) 2025-05-07T20:11:02.5618355Z libc10.so => not found 2025-05-07T20:11:02.5618708Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007fa1bed85000) 2025-05-07T20:11:02.5618803Z libtorch.so => not found 2025-05-07T20:11:02.5618895Z libtorch_cpu.so => not found 2025-05-07T20:11:02.5619001Z libtorch_cuda.so => not found 2025-05-07T20:11:02.5619097Z libtorch_cpu.so => not found 2025-05-07T20:11:02.5619190Z libtorch_cuda.so => not found 2025-05-07T20:11:02.5619293Z libtorch.so => not found 2025-05-07T20:11:02.5619742Z 2025-05-07T20:11:02.5619893Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.5620183Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:02.5620189Z 2025-05-07T20:11:02.5665256Z 2025-05-07T20:11:02.5666227Z Dynamic section at offset 0x701230 contains 38 entries: 2025-05-07T20:11:02.5666630Z Tag Type Name/Value 2025-05-07T20:11:02.5667243Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:02.5667903Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:02.5668482Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.5669118Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:02.5669709Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.5670293Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.5670883Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.5671459Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.5672001Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.5672615Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:02.5673686Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:11:02.5674092Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:02.5674260Z 0x000000000000000c (INIT) 0x178000 2025-05-07T20:11:02.5674388Z 0x000000000000000d (FINI) 0x65b3d8 2025-05-07T20:11:02.5674501Z 0x0000000000000019 (INIT_ARRAY) 0x6fcd78 2025-05-07T20:11:02.5674633Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:11:02.5674750Z 0x000000000000001a (FINI_ARRAY) 0x6fce78 2025-05-07T20:11:02.5674886Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.5674994Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:02.5675110Z 0x000000006ffffef5 (GNU_HASH) 0x6490 2025-05-07T20:11:02.5675233Z 0x0000000000000005 (STRTAB) 0x25438 2025-05-07T20:11:02.5675346Z 0x0000000000000006 (SYMTAB) 0xc448 2025-05-07T20:11:02.5675487Z 0x000000000000000a (STRSZ) 1180638 (bytes) 2025-05-07T20:11:02.5675621Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.5675736Z 0x0000000000000003 (PLTGOT) 0x7024d0 2025-05-07T20:11:02.5675872Z 0x0000000000000002 (PLTRELSZ) 20976 (bytes) 2025-05-07T20:11:02.5675979Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.5676240Z 0x0000000000000017 (JMPREL) 0x171f98 2025-05-07T20:11:02.5676350Z 0x0000000000000007 (RELA) 0x147aa0 2025-05-07T20:11:02.5676483Z 0x0000000000000008 (RELASZ) 173304 (bytes) 2025-05-07T20:11:02.5676616Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.5676711Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.5676834Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.5676966Z 0x000000006ffffffe (VERNEED) 0x147970 2025-05-07T20:11:02.5677125Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:02.5677242Z 0x000000006ffffff0 (VERSYM) 0x145816 2025-05-07T20:11:02.5677347Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:11:02.5677459Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.5677478Z 2025-05-07T20:11:02.5677594Z ################################################################################ 2025-05-07T20:11:02.5677601Z 2025-05-07T20:11:02.5677605Z 2025-05-07T20:11:02.5677715Z ################################################################################ 2025-05-07T20:11:02.5678003Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:02.5678148Z [CHECK] Listing out library size: 2025-05-07T20:11:02.5678416Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:02.5678421Z 2025-05-07T20:11:02.5683314Z 432 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:02.5684581Z 2025-05-07T20:11:02.5685209Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:02.5685709Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.5685714Z 2025-05-07T20:11:02.6074721Z GLIBC_2.2.5 2025-05-07T20:11:02.6074886Z GLIBC_2.3 2025-05-07T20:11:02.6074998Z GLIBC_2.14 2025-05-07T20:11:02.6075080Z 2025-05-07T20:11:02.6075086Z 2025-05-07T20:11:02.6075523Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:02.6076157Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.6076179Z 2025-05-07T20:11:02.6468834Z GLIBCXX_3.4 2025-05-07T20:11:02.6469246Z GLIBCXX_3.4.9 2025-05-07T20:11:02.6469515Z GLIBCXX_3.4.11 2025-05-07T20:11:02.6470046Z GLIBCXX_3.4.14 2025-05-07T20:11:02.6470289Z GLIBCXX_3.4.18 2025-05-07T20:11:02.6470519Z GLIBCXX_3.4.20 2025-05-07T20:11:02.6470763Z GLIBCXX_3.4.21 2025-05-07T20:11:02.6470989Z GLIBCXX_3.4.29 2025-05-07T20:11:02.6472711Z 2025-05-07T20:11:02.6472729Z 2025-05-07T20:11:02.6494310Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.csaP2Ygunp.symbols.txt 2025-05-07T20:11:02.6494355Z 2025-05-07T20:11:02.6839837Z 2025-05-07T20:11:02.6868018Z [CHECK] Total Number of symbols: 4997 2025-05-07T20:11:02.6894685Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:11:02.6912940Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.SXeNy7E9pb.usymbols.txt 2025-05-07T20:11:02.6912953Z 2025-05-07T20:11:02.6945745Z 2025-05-07T20:11:02.6978162Z [CHECK] Listing out undefined symbols (258 total): 2025-05-07T20:11:02.6997756Z U GOMP_parallel 2025-05-07T20:11:02.6998963Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.6999396Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.6999514Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.6999671Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.7000007Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:02.7000155Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.7000306Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:02.7000473Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:02.7000598Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:02.7000745Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:02.7000871Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:02.7001005Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:02.7001120Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:02.7001241Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:02.7001377Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:02.7001493Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:02.7001610Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:02.7001726Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:02.7001857Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:02.7002042Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:02.7002163Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.7002286Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:02.7002488Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:02.7003050Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.7003653Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.7004300Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.7004495Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:02.7004987Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.7005220Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:02.7005556Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:02.7006235Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:02.7006723Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.7006864Z U at::detail::getCUDAHooks() 2025-05-07T20:11:02.7006975Z U at::detail::getHIPHooks() 2025-05-07T20:11:02.7007086Z U at::get_num_threads() 2025-05-07T20:11:02.7007223Z U at::get_thread_num() 2025-05-07T20:11:02.7007331Z U at::globalContext() 2025-05-07T20:11:02.7007444Z U at::in_parallel_region() 2025-05-07T20:11:02.7007574Z U at::init_num_threads() 2025-05-07T20:11:02.7007700Z U at::internal::set_thread_num(int) 2025-05-07T20:11:02.7007879Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:02.7008083Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.7008323Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.7008467Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:11:02.7008873Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:11:02.7009045Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:02.7009709Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.7010112Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:02.7010229Z U c10::Error::what() const 2025-05-07T20:11:02.7010347Z U c10::GradMode::is_enabled() 2025-05-07T20:11:02.7010489Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:02.7010648Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.7010855Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.7011036Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:02.7011167Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:11:02.7011284Z U c10::IValue::isTensorList() const 2025-05-07T20:11:02.7011444Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:02.7011546Z U c10::IntType::get() 2025-05-07T20:11:02.7012028Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.7012214Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:02.7012340Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:02.7012440Z U c10::NoneType::get() 2025-05-07T20:11:02.7012785Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.7012917Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:02.7013040Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:02.7013196Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:02.7013452Z U c10::StringType::get() 2025-05-07T20:11:02.7013589Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:02.7013989Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:02.7014133Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:02.7014246Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:02.7014388Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:02.7014807Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:02.7014904Z U c10::TensorType::get() 2025-05-07T20:11:02.7015638Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:11:02.7015755Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:02.7016414Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:02.7016663Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:11:02.7016788Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:02.7016903Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:02.7017031Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:02.7017139Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:02.7017253Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:02.7017377Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:02.7017613Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:02.7017712Z U c10::cuda::device_count() 2025-05-07T20:11:02.7017841Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:02.7017981Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:02.7018117Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:02.7018245Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:02.7018435Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:02.7018544Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:02.7018945Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.7019437Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:02.7020390Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:02.7020644Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:02.7021104Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.7021443Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:02.7022021Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.7022175Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:11:02.7022296Z U c10::get_default_dtype() 2025-05-07T20:11:02.7022550Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:02.7022737Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:02.7022867Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:02.7022972Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:02.7023098Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:11:02.7023210Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:02.7023570Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:02.7023724Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:11:02.7023854Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:11:02.7024119Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:11:02.7024256Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:02.7024388Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:02.7024561Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:11:02.7024665Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:02.7024872Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:02.7024983Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:02.7025100Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:02.7025224Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:02.7025346Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:02.7025464Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:02.7025571Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:02.7025695Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:02.7025814Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:02.7025956Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:02.7026076Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:02.7026182Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:02.7026288Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:02.7026406Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:02.7026534Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:02.7027234Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7028007Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7028772Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7029520Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7030325Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7031105Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7031767Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:02.7032501Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:02.7033477Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7034337Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:02.7035209Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7035973Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:02.7037081Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:02.7037991Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7038748Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:02.7039564Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:02.7040392Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7041278Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:02.7042197Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7043033Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:02.7043925Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:02.7044840Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7045668Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7046799Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7049343Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7050193Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7051115Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7052035Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:02.7052252Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.7052417Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.7052548Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.7052711Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.7053126Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:11:02.7053303Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:02.7053451Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.7053603Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.7054185Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:11:02.7054618Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.7054757Z U memchr@GLIBC_2.2.5 2025-05-07T20:11:02.7054854Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.7054961Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.7055101Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.7055195Z U omp_get_num_threads 2025-05-07T20:11:02.7055301Z U omp_get_thread_num 2025-05-07T20:11:02.7055458Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.7055584Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.7055810Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:02.7056197Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:02.7056601Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.7057033Z U std::__cxx11::basic_stringbuf, std::allocator >::_M_sync(char*, unsigned long, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:11:02.7057373Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:02.7057712Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:02.7058123Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.7058501Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:02.7058656Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:02.7058798Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:02.7058917Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.7059049Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:02.7059190Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:02.7059334Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.7059488Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.7059662Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:02.7059824Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:02.7060077Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:02.7060537Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:02.7061192Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.7061678Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.7061851Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:11:02.7061995Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:11:02.7062330Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:11:02.7062445Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:02.7062563Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:02.7062685Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:02.7062829Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:02.7062941Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.7063059Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.7063262Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:11:02.7063366Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:02.7063484Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:02.7063862Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:02.7063989Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:11:02.7064172Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.7064570Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.7064698Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:02.7064872Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:02.7065011Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:02.7065233Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:02.7065381Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:02.7065479Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:02.7065574Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.7065705Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:02.7066730Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:02.7067927Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.7068779Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.7070218Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:11:02.7071811Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:11:02.7072668Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:02.7073553Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:11:02.7074338Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:11:02.7074947Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:11:02.7075840Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:11:02.7076849Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:11:02.7077485Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:11:02.7078264Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:11:02.7079011Z U typeinfo for c10::Error 2025-05-07T20:11:02.7079387Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:02.7079790Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:11:02.7080222Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:02.7080664Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:02.7081201Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:02.7081690Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.7082147Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:02.7082606Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.7082990Z U vtable for c10::Error 2025-05-07T20:11:02.7083579Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.7084379Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.7085191Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.7085907Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:02.7086462Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:02.7086964Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:02.7087323Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.7087688Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.7088031Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.7088349Z w __gmon_start__ 2025-05-07T20:11:02.7088667Z w __pthread_key_create 2025-05-07T20:11:02.7088989Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:02.7089355Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:02.7089740Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.7090243Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:02.7090573Z 2025-05-07T20:11:02.7090717Z linux-vdso.so.1 (0x00007ffcc5b85000) 2025-05-07T20:11:02.7091016Z libc10.so => not found 2025-05-07T20:11:02.7091307Z libc10_cuda.so => not found 2025-05-07T20:11:02.7091922Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f7820c00000) 2025-05-07T20:11:02.7092923Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f781f400000) 2025-05-07T20:11:02.7093602Z libtorch.so => not found 2025-05-07T20:11:02.7093892Z libtorch_cpu.so => not found 2025-05-07T20:11:02.7094174Z libtorch_cuda.so => not found 2025-05-07T20:11:02.7094471Z libcudart.so.12 => not found 2025-05-07T20:11:02.7094842Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f781f19c000) 2025-05-07T20:11:02.7095273Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f783c6c2000) 2025-05-07T20:11:02.7095696Z libc.so.6 => /lib64/libc.so.6 (0x00007f781ef94000) 2025-05-07T20:11:02.7096073Z /lib64/ld-linux-x86-64.so.2 (0x00007f783c6f6000) 2025-05-07T20:11:02.7096430Z libc10.so => not found 2025-05-07T20:11:02.7096946Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f783c645000) 2025-05-07T20:11:02.7097541Z libtorch.so => not found 2025-05-07T20:11:02.7097831Z libtorch_cpu.so => not found 2025-05-07T20:11:02.7098117Z libtorch_cuda.so => not found 2025-05-07T20:11:02.7098444Z libm.so.6 => /lib64/libm.so.6 (0x00007f7820b25000) 2025-05-07T20:11:02.7098784Z libtorch.so => not found 2025-05-07T20:11:02.7099072Z libc10.so => not found 2025-05-07T20:11:02.7099363Z libc10_cuda.so => not found 2025-05-07T20:11:02.7099669Z libtorch_cpu.so => not found 2025-05-07T20:11:02.7099951Z libtorch_cuda.so => not found 2025-05-07T20:11:02.7100261Z libcudart.so.12 => not found 2025-05-07T20:11:02.7100570Z libtorch_cpu.so => not found 2025-05-07T20:11:02.7100880Z libtorch_cuda.so => not found 2025-05-07T20:11:02.7101190Z libtorch.so => not found 2025-05-07T20:11:02.7101359Z 2025-05-07T20:11:02.7101478Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.7101951Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:02.7102307Z 2025-05-07T20:11:02.7102337Z 2025-05-07T20:11:02.7102508Z Dynamic section at offset 0x1af13978 contains 40 entries: 2025-05-07T20:11:02.7102934Z Tag Type Name/Value 2025-05-07T20:11:02.7103394Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:02.7103912Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:02.7104462Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:02.7105003Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:02.7105690Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.7106310Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.7107043Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.7107761Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:02.7108342Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.7108885Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.7109395Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.7109966Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:02.7110568Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:11:02.7111107Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:02.7111544Z 0x000000000000000c (INIT) 0x19a000 2025-05-07T20:11:02.7111899Z 0x000000000000000d (FINI) 0x7e3f4c 2025-05-07T20:11:02.7112277Z 0x0000000000000019 (INIT_ARRAY) 0x1af13d58 2025-05-07T20:11:02.7112648Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:11:02.7113037Z 0x000000000000001a (FINI_ARRAY) 0x1af13ee0 2025-05-07T20:11:02.7113428Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.7113798Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:02.7114163Z 0x000000006ffffef5 (GNU_HASH) 0x7048 2025-05-07T20:11:02.7114508Z 0x0000000000000005 (STRTAB) 0x2bee8 2025-05-07T20:11:02.7114871Z 0x0000000000000006 (SYMTAB) 0xea58 2025-05-07T20:11:02.7115240Z 0x000000000000000a (STRSZ) 1363139 (bytes) 2025-05-07T20:11:02.7115647Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.7116079Z 0x0000000000000003 (PLTGOT) 0x1af14c38 2025-05-07T20:11:02.7116496Z 0x0000000000000002 (PLTRELSZ) 15648 (bytes) 2025-05-07T20:11:02.7116885Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.7117224Z 0x0000000000000017 (JMPREL) 0x195ff8 2025-05-07T20:11:02.7117596Z 0x0000000000000007 (RELA) 0x17b418 2025-05-07T20:11:02.7117960Z 0x0000000000000008 (RELASZ) 109536 (bytes) 2025-05-07T20:11:02.7118362Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.7118701Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.7119061Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.7119426Z 0x000000006ffffffe (VERNEED) 0x17b2b8 2025-05-07T20:11:02.7119798Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:02.7120194Z 0x000000006ffffff0 (VERSYM) 0x178bac 2025-05-07T20:11:02.7120540Z 0x000000006ffffff9 (RELACOUNT) 79 2025-05-07T20:11:02.7120887Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.7121128Z 2025-05-07T20:11:02.7121253Z ################################################################################ 2025-05-07T20:11:02.7121506Z 2025-05-07T20:11:02.7121510Z 2025-05-07T20:11:02.7121631Z ################################################################################ 2025-05-07T20:11:02.7122231Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:02.7122810Z [CHECK] Listing out library size: 2025-05-07T20:11:02.7123373Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:02.7123827Z 2025-05-07T20:11:02.7124115Z 4 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:02.7124523Z 2025-05-07T20:11:02.7124995Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:02.7126164Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.7126867Z 2025-05-07T20:11:02.7338331Z GLIBC_2.2.5 2025-05-07T20:11:02.7339126Z GLIBC_2.3 2025-05-07T20:11:02.7339445Z GLIBC_2.14 2025-05-07T20:11:02.7340562Z 2025-05-07T20:11:02.7340598Z 2025-05-07T20:11:02.7341217Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:02.7342424Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.7343117Z 2025-05-07T20:11:02.7557034Z GLIBCXX_3.4 2025-05-07T20:11:02.7557727Z GLIBCXX_3.4.9 2025-05-07T20:11:02.7557990Z GLIBCXX_3.4.11 2025-05-07T20:11:02.7558229Z GLIBCXX_3.4.15 2025-05-07T20:11:02.7558475Z GLIBCXX_3.4.18 2025-05-07T20:11:02.7558700Z GLIBCXX_3.4.20 2025-05-07T20:11:02.7558940Z GLIBCXX_3.4.21 2025-05-07T20:11:02.7559176Z GLIBCXX_3.4.29 2025-05-07T20:11:02.7559315Z 2025-05-07T20:11:02.7559320Z 2025-05-07T20:11:02.7581468Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.HZpaAtcqlf.symbols.txt 2025-05-07T20:11:02.7583468Z 2025-05-07T20:11:02.7760278Z 2025-05-07T20:11:02.7788195Z [CHECK] Total Number of symbols: 2654 2025-05-07T20:11:02.7809326Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:11:02.7825692Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.oKBnzmNEmR.usymbols.txt 2025-05-07T20:11:02.7826444Z 2025-05-07T20:11:02.7848973Z 2025-05-07T20:11:02.7874352Z [CHECK] Listing out undefined symbols (194 total): 2025-05-07T20:11:02.7888643Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.7890341Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:02.7891331Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:02.7892296Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:02.7893236Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:02.7894178Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:02.7895073Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:02.7895455Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:02.7895799Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:02.7896173Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:02.7896514Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:02.7896870Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:02.7897368Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:02.7897741Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:02.7898158Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:02.7898571Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:02.7899036Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:02.7899390Z U at::RecordFunction::end() 2025-05-07T20:11:02.7899810Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:02.7900224Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:02.7901259Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.7902448Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:02.7903599Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.7905067Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:02.7905974Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:02.7906476Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:02.7907006Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:02.7907464Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:02.7907915Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:02.7908322Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:02.7908742Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:02.7909165Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:02.7909516Z U c10::AnyType::get() 2025-05-07T20:11:02.7909849Z U c10::BoolType::get() 2025-05-07T20:11:02.7910239Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:02.7910724Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:02.7911491Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:02.7912734Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:02.7913871Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:02.7914495Z U c10::Error::what() const 2025-05-07T20:11:02.7914820Z U c10::FloatType::get() 2025-05-07T20:11:02.7915167Z U c10::GradMode::is_enabled() 2025-05-07T20:11:02.7915500Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:02.7915901Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:02.7916429Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:02.7916982Z U c10::IValue::isBoolList() const 2025-05-07T20:11:02.7917435Z U c10::IValue::isIntList() const 2025-05-07T20:11:02.7917786Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:02.7918202Z U c10::IValue::isTensorList() const 2025-05-07T20:11:02.7918580Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:02.7918996Z U c10::IntType::get() 2025-05-07T20:11:02.7919683Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.7920481Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:02.7920927Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:02.7921303Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:02.7921701Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:02.7922173Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.7922830Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:02.7923359Z U c10::StringType::get() 2025-05-07T20:11:02.7923718Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:02.7924157Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:02.7924597Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:02.7925102Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:02.7925553Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:02.7926227Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:02.7926904Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:02.7927282Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:02.7927688Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:11:02.7928102Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:02.7928484Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:02.7928982Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:02.7929333Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:02.7929710Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:02.7930067Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:02.7930422Z U c10::SymIntType::get() 2025-05-07T20:11:02.7930765Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:02.7931090Z U c10::TensorType::get() 2025-05-07T20:11:02.7931427Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:02.7932047Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:02.7933233Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:02.7934120Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:02.7935164Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.7936137Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:02.7937210Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:02.7938260Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:02.7938911Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:02.7939398Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:02.7939797Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:02.7940474Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:02.7941102Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:02.7941534Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:02.7941999Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:02.7942420Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:02.7942904Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:02.7943348Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:02.7943874Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:02.7944385Z U free@GLIBC_2.2.5 2025-05-07T20:11:02.7944765Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:02.7945226Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:02.7945532Z U memcpy@GLIBC_2.14 2025-05-07T20:11:02.7945862Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:02.7946171Z U memset@GLIBC_2.2.5 2025-05-07T20:11:02.7946827Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:02.7947236Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:02.7947640Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:02.7948088Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:02.7948773Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:02.7949666Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.7950544Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:02.7951333Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:02.7952257Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:02.7953147Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:02.7953759Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:02.7954128Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:02.7954506Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.7954927Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.7955362Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:02.7955810Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:02.7956316Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:02.7956886Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:02.7957618Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:02.7958737Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.7959987Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:02.7960772Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:02.7961169Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:02.7961524Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:02.7961901Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.7962254Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:02.7962627Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:02.7962977Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:02.7963415Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.7964170Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:02.7964661Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:02.7965093Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:02.7965561Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:02.7966262Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:02.7966981Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:02.7967357Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:02.7967694Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:02.7967991Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:02.7968455Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:02.7969236Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:02.7970317Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.7971110Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:02.7971628Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:02.7972118Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:02.7972692Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:02.7973164Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:02.7973665Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:02.7974481Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:02.7975087Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:02.7975570Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:02.7976233Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:02.7976675Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:02.7977058Z U torch::autograd::Node::metadata() 2025-05-07T20:11:02.7977430Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:02.7977952Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:02.7978620Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:02.7979219Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:02.7979713Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:02.7980265Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:02.7983296Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:02.7986232Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:02.7986720Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:02.7987164Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:02.7988482Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:02.7989676Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:02.7990350Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:02.7991434Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:02.7992045Z U typeinfo for c10::Error 2025-05-07T20:11:02.7992406Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:02.7992819Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:02.7993222Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:02.7993630Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:02.7994017Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:02.7994402Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:02.7994864Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:02.7995292Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:02.7995750Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:02.7996232Z U vtable for c10::Error 2025-05-07T20:11:02.7996798Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.7997708Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:02.7998290Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:02.7998771Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:02.7999354Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:02.7999860Z U vtable for torch::autograd::Node 2025-05-07T20:11:02.8000300Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:02.8000725Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:02.8001123Z w _ITM_registerTMCloneTable 2025-05-07T20:11:02.8001485Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:02.8001810Z w __gmon_start__ 2025-05-07T20:11:02.8002138Z w __pthread_key_create 2025-05-07T20:11:02.8002467Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:02.8002856Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:02.8003245Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:02.8003832Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:02.8004247Z 2025-05-07T20:11:02.8004433Z linux-vdso.so.1 (0x00007ffe82703000) 2025-05-07T20:11:02.8004749Z libc10.so => not found 2025-05-07T20:11:02.8005407Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007fae5fdce000) 2025-05-07T20:11:02.8006475Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fae5e800000) 2025-05-07T20:11:02.8007164Z libtorch.so => not found 2025-05-07T20:11:02.8007506Z libtorch_cpu.so => not found 2025-05-07T20:11:02.8007799Z libtorch_cuda.so => not found 2025-05-07T20:11:02.8008192Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fae5e59c000) 2025-05-07T20:11:02.8008645Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fae5f9d2000) 2025-05-07T20:11:02.8009196Z libc.so.6 => /lib64/libc.so.6 (0x00007fae5e394000) 2025-05-07T20:11:02.8009577Z /lib64/ld-linux-x86-64.so.2 (0x00007fae5fdde000) 2025-05-07T20:11:02.8009932Z libc10.so => not found 2025-05-07T20:11:02.8010210Z libtorch_cpu.so => not found 2025-05-07T20:11:02.8010487Z libtorch_cuda.so => not found 2025-05-07T20:11:02.8010786Z libtorch.so => not found 2025-05-07T20:11:02.8011044Z libtorch.so => not found 2025-05-07T20:11:02.8011321Z libc10.so => not found 2025-05-07T20:11:02.8011574Z libc10_cuda.so => not found 2025-05-07T20:11:02.8011871Z libtorch_cpu.so => not found 2025-05-07T20:11:02.8012150Z libtorch_cuda.so => not found 2025-05-07T20:11:02.8012449Z libcudart.so.12 => not found 2025-05-07T20:11:02.8012759Z libm.so.6 => /lib64/libm.so.6 (0x00007fae5e2b9000) 2025-05-07T20:11:02.8013017Z 2025-05-07T20:11:02.8013137Z [CHECK] Displaying ELF information: 2025-05-07T20:11:02.8013675Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:02.8014124Z 2025-05-07T20:11:02.8014129Z 2025-05-07T20:11:02.8014294Z Dynamic section at offset 0x39abb0 contains 38 entries: 2025-05-07T20:11:02.8014703Z Tag Type Name/Value 2025-05-07T20:11:02.8015234Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:02.8015758Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:02.8016311Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:02.8016814Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:02.8017319Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:02.8017817Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:02.8018341Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:02.8018830Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:02.8019337Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:02.8019852Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:02.8020472Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:11:02.8021072Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:02.8021464Z 0x000000000000000c (INIT) 0xb9000 2025-05-07T20:11:02.8021844Z 0x000000000000000d (FINI) 0x33effc 2025-05-07T20:11:02.8022179Z 0x0000000000000019 (INIT_ARRAY) 0x397b28 2025-05-07T20:11:02.8022553Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:11:02.8022922Z 0x000000000000001a (FINI_ARRAY) 0x397c58 2025-05-07T20:11:02.8023258Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:02.8023607Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:02.8023927Z 0x000000006ffffef5 (GNU_HASH) 0x3b08 2025-05-07T20:11:02.8024281Z 0x0000000000000005 (STRTAB) 0x17258 2025-05-07T20:11:02.8024603Z 0x0000000000000006 (SYMTAB) 0x7970 2025-05-07T20:11:02.8024969Z 0x000000000000000a (STRSZ) 529940 (bytes) 2025-05-07T20:11:02.8025321Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:02.8025678Z 0x0000000000000003 (PLTGOT) 0x39ae50 2025-05-07T20:11:02.8026039Z 0x0000000000000002 (PLTRELSZ) 14112 (bytes) 2025-05-07T20:11:02.8026367Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:02.8026695Z 0x0000000000000017 (JMPREL) 0xb52c8 2025-05-07T20:11:02.8027043Z 0x0000000000000007 (RELA) 0x99e60 2025-05-07T20:11:02.8027402Z 0x0000000000000008 (RELASZ) 111720 (bytes) 2025-05-07T20:11:02.8027745Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:02.8028089Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:02.8028418Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:02.8028757Z 0x000000006ffffffe (VERNEED) 0x99d30 2025-05-07T20:11:02.8029095Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:02.8029409Z 0x000000006ffffff0 (VERSYM) 0x9886c 2025-05-07T20:11:02.8029754Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:11:02.8030053Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:02.8030268Z 2025-05-07T20:11:02.8030389Z ################################################################################ 2025-05-07T20:11:02.8030604Z 2025-05-07T20:11:02.8030608Z 2025-05-07T20:11:02.8030743Z ################################################################################ 2025-05-07T20:11:02.8031248Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:02.8031790Z [CHECK] Listing out library size: 2025-05-07T20:11:02.8032248Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:02.8032643Z 2025-05-07T20:11:02.8033051Z 343 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:02.8033392Z 2025-05-07T20:11:02.8033825Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:02.8034852Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.8035483Z 2025-05-07T20:11:02.8949080Z GLIBC_2.2.5 2025-05-07T20:11:02.8949741Z GLIBC_2.3 2025-05-07T20:11:02.8950362Z GLIBC_2.14 2025-05-07T20:11:02.8952170Z 2025-05-07T20:11:02.8952183Z 2025-05-07T20:11:02.8953542Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:02.8957097Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:02.8957772Z 2025-05-07T20:11:02.9897988Z GLIBCXX_3.4 2025-05-07T20:11:02.9898397Z GLIBCXX_3.4.9 2025-05-07T20:11:02.9898657Z GLIBCXX_3.4.20 2025-05-07T20:11:02.9898914Z GLIBCXX_3.4.21 2025-05-07T20:11:02.9901482Z GLIBCXX_3.4.29 2025-05-07T20:11:02.9901720Z 2025-05-07T20:11:02.9901725Z 2025-05-07T20:11:02.9919332Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.7TFtbEDGMS.symbols.txt 2025-05-07T20:11:02.9920877Z 2025-05-07T20:11:03.0843751Z 2025-05-07T20:11:03.0884493Z [CHECK] Total Number of symbols: 12731 2025-05-07T20:11:03.0932855Z [CHECK] Number of fbgemm symbols: 5268 2025-05-07T20:11:03.0949817Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.KYpofICzn2.usymbols.txt 2025-05-07T20:11:03.0951392Z 2025-05-07T20:11:03.1002101Z 2025-05-07T20:11:03.1029051Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:11:03.1042224Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.1043999Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:03.1045072Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:03.1046057Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:03.1046686Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:03.1047197Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:03.1047587Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:03.1048175Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:03.1048563Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:03.1048983Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:03.1049368Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:03.1049709Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:03.1050082Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:03.1050423Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:03.1050797Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:03.1051147Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:03.1051528Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:03.1051874Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:03.1052232Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:03.1052593Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:03.1053046Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:03.1053460Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:03.1054064Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:03.1054600Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:03.1055290Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:03.1077721Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:11:03.1078405Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:11:03.1079515Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.1080491Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:03.1081028Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:03.1081541Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:03.1082046Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:03.1082543Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.1083187Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.1083662Z U c10::BoolType::get() 2025-05-07T20:11:03.1084087Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:03.1084580Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:03.1085028Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:03.1085786Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:03.1087081Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:03.1088233Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:03.1088964Z U c10::Error::what() const 2025-05-07T20:11:03.1089283Z U c10::FloatType::get() 2025-05-07T20:11:03.1089634Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.1090277Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.1090753Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:03.1091141Z U c10::IntType::get() 2025-05-07T20:11:03.1091541Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:03.1092129Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:03.1092525Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:03.1092893Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:03.1093326Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:03.1093764Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:03.1094183Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:03.1094881Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:03.1095548Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:03.1095958Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:11:03.1096379Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:03.1096803Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:03.1097200Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:03.1097589Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:11:03.1097993Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:03.1098384Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:03.1098785Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:03.1099181Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:03.1099518Z U c10::SymIntType::get() 2025-05-07T20:11:03.1099871Z U c10::TensorType::get() 2025-05-07T20:11:03.1100217Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:03.1101207Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:03.1102206Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:03.1102594Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:03.1102984Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:03.1103381Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:03.1103771Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:03.1104163Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:03.1104678Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:03.1105197Z U c10::cuda::device_count() 2025-05-07T20:11:03.1105561Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:03.1105984Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:03.1106391Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:03.1106820Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:03.1107265Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:03.1107392Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:03.1107928Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:03.1108223Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:03.1108729Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:03.1109119Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:03.1109732Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:03.1109844Z U c10::get_default_dtype() 2025-05-07T20:11:03.1109970Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:03.1110235Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:03.1110680Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:03.1110861Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:03.1111012Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:03.1111135Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:03.1111290Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:03.1111468Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:11:03.1111593Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:11:03.1111721Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:03.1111863Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:03.1112009Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:03.1112147Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:03.1112311Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:03.1112471Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:03.1112581Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:03.1112771Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:03.1112923Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:03.1113058Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:03.1113178Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:03.1113332Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:03.1113447Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:03.1113567Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:03.1113748Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:03.1113871Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:03.1114015Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:03.1114134Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:03.1114274Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:03.1114409Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:03.1114532Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:03.1114839Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:03.1114963Z U float at::Tensor::item() const 2025-05-07T20:11:03.1115104Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.1115284Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.1115389Z U free@GLIBC_2.2.5 2025-05-07T20:11:03.1115506Z U int at::Tensor::item() const 2025-05-07T20:11:03.1115661Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.1115806Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.1115979Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:03.1116286Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.1116617Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.1116737Z U memcpy@GLIBC_2.14 2025-05-07T20:11:03.1116845Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:03.1116982Z U memset@GLIBC_2.2.5 2025-05-07T20:11:03.1117230Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:03.1117368Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:03.1117753Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:03.1118160Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:03.1118509Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:03.1118664Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:03.1118857Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.1119012Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.1119219Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:03.1119473Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:03.1119835Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:03.1120448Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.1120974Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.1121141Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:03.1121273Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:03.1121407Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:03.1121538Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:03.1121691Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:03.1121846Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:03.1121971Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:03.1122222Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.1122474Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.1122611Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:03.1122752Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:03.1122863Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:03.1122997Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:03.1123623Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:03.1124100Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:03.1124372Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:03.1124771Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:03.1124943Z U typeinfo for c10::Error 2025-05-07T20:11:03.1125107Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:03.1125306Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:03.1125479Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:03.1125593Z U vtable for c10::Error 2025-05-07T20:11:03.1125981Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.1126323Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.1126536Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:03.1126805Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:03.1126996Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:03.1127120Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:03.1127304Z w _ITM_registerTMCloneTable 2025-05-07T20:11:03.1127422Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:03.1127535Z w __gmon_start__ 2025-05-07T20:11:03.1127660Z w __pthread_key_create 2025-05-07T20:11:03.1127817Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:03.1128107Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:03.1128115Z 2025-05-07T20:11:03.1128252Z linux-vdso.so.1 (0x00007fffc21c6000) 2025-05-07T20:11:03.1128356Z libc10.so => not found 2025-05-07T20:11:03.1128457Z libc10_cuda.so => not found 2025-05-07T20:11:03.1129090Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f6555e00000) 2025-05-07T20:11:03.1129193Z libtorch.so => not found 2025-05-07T20:11:03.1129295Z libtorch_cpu.so => not found 2025-05-07T20:11:03.1129429Z libtorch_cuda.so => not found 2025-05-07T20:11:03.1129533Z libcudart.so.12 => not found 2025-05-07T20:11:03.1129691Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f6555b9c000) 2025-05-07T20:11:03.1129845Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f656c171000) 2025-05-07T20:11:03.1130001Z libc.so.6 => /lib64/libc.so.6 (0x00007f6555994000) 2025-05-07T20:11:03.1130134Z /lib64/ld-linux-x86-64.so.2 (0x00007f656c1a5000) 2025-05-07T20:11:03.1130257Z libc10.so => not found 2025-05-07T20:11:03.1130379Z libc10_cuda.so => not found 2025-05-07T20:11:03.1130748Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f6555400000) 2025-05-07T20:11:03.1131171Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f656c163000) 2025-05-07T20:11:03.1131292Z libtorch.so => not found 2025-05-07T20:11:03.1131394Z libtorch_cpu.so => not found 2025-05-07T20:11:03.1131497Z libtorch_cuda.so => not found 2025-05-07T20:11:03.1131602Z libcudart.so.12 => not found 2025-05-07T20:11:03.1131750Z libm.so.6 => /lib64/libm.so.6 (0x00007f6555325000) 2025-05-07T20:11:03.1131846Z libc10.so => not found 2025-05-07T20:11:03.1132183Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f6556185000) 2025-05-07T20:11:03.1132309Z libtorch.so => not found 2025-05-07T20:11:03.1132409Z libtorch_cpu.so => not found 2025-05-07T20:11:03.1132513Z libtorch_cuda.so => not found 2025-05-07T20:11:03.1132608Z libc10.so => not found 2025-05-07T20:11:03.1132731Z libtorch_cpu.so => not found 2025-05-07T20:11:03.1132834Z libtorch_cuda.so => not found 2025-05-07T20:11:03.1132938Z libtorch.so => not found 2025-05-07T20:11:03.1133058Z libtorch_cpu.so => not found 2025-05-07T20:11:03.1133186Z libtorch_cuda.so => not found 2025-05-07T20:11:03.1133289Z libtorch.so => not found 2025-05-07T20:11:03.1133294Z 2025-05-07T20:11:03.1133404Z [CHECK] Displaying ELF information: 2025-05-07T20:11:03.1133692Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:03.1133697Z 2025-05-07T20:11:03.1152248Z 2025-05-07T20:11:03.1152895Z Dynamic section at offset 0x1569a110 contains 39 entries: 2025-05-07T20:11:03.1153322Z Tag Type Name/Value 2025-05-07T20:11:03.1153536Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:03.1153757Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:03.1154023Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:03.1154240Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:03.1154454Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:03.1154700Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:03.1154912Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:03.1155226Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:03.1155432Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:03.1155657Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:03.1155882Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:03.1156248Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:11:03.1156479Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:03.1156607Z 0x000000000000000c (INIT) 0x44b000 2025-05-07T20:11:03.1156741Z 0x000000000000000d (FINI) 0x22530cc 2025-05-07T20:11:03.1156910Z 0x0000000000000019 (INIT_ARRAY) 0x15698508 2025-05-07T20:11:03.1157089Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:11:03.1157222Z 0x000000000000001a (FINI_ARRAY) 0x156987f8 2025-05-07T20:11:03.1157358Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:03.1157511Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:03.1157642Z 0x000000006ffffef5 (GNU_HASH) 0x10898 2025-05-07T20:11:03.1157772Z 0x0000000000000005 (STRTAB) 0x6f610 2025-05-07T20:11:03.1157923Z 0x0000000000000006 (SYMTAB) 0x24c70 2025-05-07T20:11:03.1158122Z 0x000000000000000a (STRSZ) 3691715 (bytes) 2025-05-07T20:11:03.1158258Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:03.1158458Z 0x0000000000000003 (PLTGOT) 0x1569a3c0 2025-05-07T20:11:03.1158597Z 0x0000000000000002 (PLTRELSZ) 10920 (bytes) 2025-05-07T20:11:03.1158707Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:03.1158834Z 0x0000000000000017 (JMPREL) 0x4484b0 2025-05-07T20:11:03.1158984Z 0x0000000000000007 (RELA) 0x3faf60 2025-05-07T20:11:03.1159125Z 0x0000000000000008 (RELASZ) 316752 (bytes) 2025-05-07T20:11:03.1159252Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:03.1159373Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:03.1159502Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:03.1159624Z 0x000000006ffffffe (VERNEED) 0x3fae50 2025-05-07T20:11:03.1159756Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:03.1159879Z 0x000000006ffffff0 (VERSYM) 0x3f4ad4 2025-05-07T20:11:03.1159996Z 0x000000006ffffff9 (RELACOUNT) 136 2025-05-07T20:11:03.1160100Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:03.1160141Z 2025-05-07T20:11:03.1160264Z ################################################################################ 2025-05-07T20:11:03.1160310Z 2025-05-07T20:11:03.1160314Z 2025-05-07T20:11:03.1160434Z ################################################################################ 2025-05-07T20:11:03.1160771Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:03.1160884Z [CHECK] Listing out library size: 2025-05-07T20:11:03.1161188Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:03.1161193Z 2025-05-07T20:11:03.1165616Z 1 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:03.1166603Z 2025-05-07T20:11:03.1167505Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:03.1168043Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:03.1168050Z 2025-05-07T20:11:03.1226176Z GLIBC_2.2.5 2025-05-07T20:11:03.1226595Z GLIBC_2.3 2025-05-07T20:11:03.1227444Z GLIBC_2.14 2025-05-07T20:11:03.1227503Z 2025-05-07T20:11:03.1227517Z 2025-05-07T20:11:03.1229137Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:03.1230819Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:03.1230836Z 2025-05-07T20:11:03.1283918Z GLIBCXX_3.4 2025-05-07T20:11:03.1284363Z GLIBCXX_3.4.9 2025-05-07T20:11:03.1284504Z GLIBCXX_3.4.18 2025-05-07T20:11:03.1284623Z GLIBCXX_3.4.20 2025-05-07T20:11:03.1284712Z GLIBCXX_3.4.21 2025-05-07T20:11:03.1284837Z GLIBCXX_3.4.29 2025-05-07T20:11:03.1284850Z 2025-05-07T20:11:03.1284861Z 2025-05-07T20:11:03.1306576Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.efXdhCWMen.symbols.txt 2025-05-07T20:11:03.1306711Z 2025-05-07T20:11:03.1331422Z 2025-05-07T20:11:03.1355145Z [CHECK] Total Number of symbols: 356 2025-05-07T20:11:03.1367119Z [CHECK] Number of fbgemm symbols: 56 2025-05-07T20:11:03.1386572Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.wca4dcX7Cm.usymbols.txt 2025-05-07T20:11:03.1386596Z 2025-05-07T20:11:03.1399889Z 2025-05-07T20:11:03.1427780Z [CHECK] Listing out undefined symbols (123 total): 2025-05-07T20:11:03.1440832Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.1441930Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.1442396Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:03.1442840Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:03.1443266Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:03.1443645Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:03.1443974Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:03.1444117Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:03.1444238Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:03.1444405Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:03.1444517Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:03.1444634Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:03.1444766Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:03.1444883Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:03.1445001Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:03.1445117Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:03.1445248Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:03.1445409Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:03.1445525Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:03.1445653Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:03.1446255Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.1447103Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.1447309Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:03.1447465Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:03.1447574Z U c10::IntType::get() 2025-05-07T20:11:03.1447782Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:03.1447914Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:03.1448144Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:03.1448631Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:03.1448778Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:03.1448889Z U c10::TensorType::get() 2025-05-07T20:11:03.1449045Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:03.1449773Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:03.1449917Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:03.1450070Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:03.1450199Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:03.1450323Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:03.1450479Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:03.1450602Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:03.1450854Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:03.1451023Z U c10::cuda::device_count() 2025-05-07T20:11:03.1451170Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:03.1451312Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:03.1451516Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:03.1451666Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:03.1451831Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:03.1451978Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:03.1452500Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:03.1452877Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:03.1453389Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:03.1453834Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:03.1453953Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:03.1454089Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:03.1454249Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:03.1454387Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:03.1454537Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:03.1454641Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:03.1454828Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:03.1454974Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:03.1455106Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:03.1455222Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:03.1455346Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:03.1455481Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:03.1455598Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:03.1455721Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:03.1455870Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:03.1455986Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:03.1456132Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:03.1456262Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:03.1456368Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:03.1456497Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:03.1456617Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:03.1456773Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.1456950Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:03.1457102Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.1457212Z U memcpy@GLIBC_2.14 2025-05-07T20:11:03.1457315Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:03.1457415Z U memset@GLIBC_2.2.5 2025-05-07T20:11:03.1457568Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:03.1457693Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:03.1458019Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:03.1458405Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:03.1458772Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:03.1459158Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:03.1459302Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:03.1459422Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:03.1459562Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.1459718Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.1459888Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:03.1460118Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:03.1460462Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:03.1461004Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.1461483Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.1461651Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:03.1461780Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:03.1461899Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:03.1462035Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:03.1462156Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:03.1462274Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:03.1462463Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.1462712Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.1462843Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:03.1462963Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:03.1463063Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:03.1463183Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:03.1463784Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:03.1464221Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:03.1464506Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:03.1464851Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:03.1465065Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.1465236Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:03.1465395Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:03.1465554Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:03.1465906Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.1466273Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.1466601Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.1466838Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:03.1467059Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:03.1467175Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:03.1467320Z w _ITM_registerTMCloneTable 2025-05-07T20:11:03.1467428Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:03.1467532Z w __gmon_start__ 2025-05-07T20:11:03.1467662Z w __pthread_key_create 2025-05-07T20:11:03.1467811Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:03.1468048Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:03.1468055Z 2025-05-07T20:11:03.1483816Z linux-vdso.so.1 (0x00007fffda502000) 2025-05-07T20:11:03.1484048Z libtorch.so => not found 2025-05-07T20:11:03.1484208Z libc10.so => not found 2025-05-07T20:11:03.1484325Z libc10_cuda.so => not found 2025-05-07T20:11:03.1484473Z libtorch_cpu.so => not found 2025-05-07T20:11:03.1484576Z libtorch_cuda.so => not found 2025-05-07T20:11:03.1484686Z libcudart.so.12 => not found 2025-05-07T20:11:03.1485012Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa79c683000) 2025-05-07T20:11:03.1485177Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa79c655000) 2025-05-07T20:11:03.1485319Z libc.so.6 => /lib64/libc.so.6 (0x00007fa79c44d000) 2025-05-07T20:11:03.1485478Z /lib64/ld-linux-x86-64.so.2 (0x00007fa79c95a000) 2025-05-07T20:11:03.1485605Z libm.so.6 => /lib64/libm.so.6 (0x00007fa79c372000) 2025-05-07T20:11:03.1485618Z 2025-05-07T20:11:03.1485735Z [CHECK] Displaying ELF information: 2025-05-07T20:11:03.1486042Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:03.1486050Z 2025-05-07T20:11:03.1516834Z 2025-05-07T20:11:03.1517613Z Dynamic section at offset 0x6a540 contains 37 entries: 2025-05-07T20:11:03.1518008Z Tag Type Name/Value 2025-05-07T20:11:03.1518694Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:03.1519259Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:03.1519864Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:03.1520484Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:03.1523390Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:03.1523741Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:03.1523962Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:03.1524162Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:03.1524350Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:03.1524588Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:03.1524841Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:11:03.1524964Z 0x000000000000000c (INIT) 0xf000 2025-05-07T20:11:03.1525079Z 0x000000000000000d (FINI) 0x2c63c 2025-05-07T20:11:03.1525221Z 0x0000000000000019 (INIT_ARRAY) 0x6b1f8 2025-05-07T20:11:03.1525350Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:11:03.1525461Z 0x000000000000001a (FINI_ARRAY) 0x6b220 2025-05-07T20:11:03.1525606Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:03.1525716Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:03.1525835Z 0x000000006ffffef5 (GNU_HASH) 0x12b0 2025-05-07T20:11:03.1526016Z 0x0000000000000005 (STRTAB) 0x3ff0 2025-05-07T20:11:03.1526133Z 0x0000000000000006 (SYMTAB) 0x1e78 2025-05-07T20:11:03.1526269Z 0x000000000000000a (STRSZ) 31425 (bytes) 2025-05-07T20:11:03.1526426Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:03.1526570Z 0x0000000000000003 (PLTGOT) 0x6b7e0 2025-05-07T20:11:03.1526709Z 0x0000000000000002 (PLTRELSZ) 4320 (bytes) 2025-05-07T20:11:03.1526823Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:03.1526952Z 0x0000000000000017 (JMPREL) 0xd0f8 2025-05-07T20:11:03.1527063Z 0x0000000000000007 (RELA) 0xbeb0 2025-05-07T20:11:03.1527195Z 0x0000000000000008 (RELASZ) 4680 (bytes) 2025-05-07T20:11:03.1527344Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:03.1527450Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:03.1527579Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:03.1527702Z 0x000000006ffffffe (VERNEED) 0xbd80 2025-05-07T20:11:03.1528014Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:03.1528316Z 0x000000006ffffff0 (VERSYM) 0xbab2 2025-05-07T20:11:03.1528437Z 0x000000006ffffff9 (RELACOUNT) 24 2025-05-07T20:11:03.1528571Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:03.1528577Z 2025-05-07T20:11:03.1528736Z ################################################################################ 2025-05-07T20:11:03.1528741Z 2025-05-07T20:11:03.1528746Z 2025-05-07T20:11:03.1528871Z ################################################################################ 2025-05-07T20:11:03.1529183Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:03.1529287Z [CHECK] Listing out library size: 2025-05-07T20:11:03.1529570Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:03.1529575Z 2025-05-07T20:11:03.1529819Z 35 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:03.1529824Z 2025-05-07T20:11:03.1530228Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:03.1530730Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:03.1530737Z 2025-05-07T20:11:03.1638985Z GLIBC_2.2.5 2025-05-07T20:11:03.1639242Z GLIBC_2.3 2025-05-07T20:11:03.1639484Z GLIBC_2.14 2025-05-07T20:11:03.1639802Z 2025-05-07T20:11:03.1639815Z 2025-05-07T20:11:03.1641116Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:03.1642689Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:03.1642706Z 2025-05-07T20:11:03.1749486Z GLIBCXX_3.4 2025-05-07T20:11:03.1749773Z GLIBCXX_3.4.9 2025-05-07T20:11:03.1750016Z GLIBCXX_3.4.11 2025-05-07T20:11:03.1750249Z GLIBCXX_3.4.15 2025-05-07T20:11:03.1750497Z GLIBCXX_3.4.18 2025-05-07T20:11:03.1750736Z GLIBCXX_3.4.20 2025-05-07T20:11:03.1750964Z GLIBCXX_3.4.21 2025-05-07T20:11:03.1751213Z GLIBCXX_3.4.29 2025-05-07T20:11:03.1751247Z 2025-05-07T20:11:03.1751260Z 2025-05-07T20:11:03.1773036Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.ASXmGO3ot6.symbols.txt 2025-05-07T20:11:03.1851440Z 2025-05-07T20:11:03.1851485Z 2025-05-07T20:11:03.1879133Z [CHECK] Total Number of symbols: 1477 2025-05-07T20:11:03.1896629Z [CHECK] Number of fbgemm symbols: 213 2025-05-07T20:11:03.1915243Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.F0E5Tv8BLs.usymbols.txt 2025-05-07T20:11:03.1915291Z 2025-05-07T20:11:03.1939605Z 2025-05-07T20:11:03.1978231Z [CHECK] Listing out undefined symbols (270 total): 2025-05-07T20:11:03.2000698Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.2002278Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.2002588Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:03.2003034Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:03.2003464Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:03.2003845Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:03.2004232Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:03.2004612Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:03.2004951Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:03.2005337Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:03.2005676Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:03.2006136Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:03.2006249Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:03.2006356Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:03.2006482Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:03.2006643Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:03.2006754Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:03.2006878Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:03.2006983Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:03.2007096Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:03.2007197Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:03.2007321Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:03.2007425Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:03.2007543Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:03.2007707Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:03.2007885Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:03.2008016Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:03.2008160Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:03.2008318Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:03.2008507Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:03.2008683Z U at::TensorMaker::make_tensor() 2025-05-07T20:11:03.2008803Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:11:03.2008955Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:11:03.2009132Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:03.2009727Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.2010386Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.2010580Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:03.2010758Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:03.2010950Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:03.2011137Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:03.2011479Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:03.2011687Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:03.2011851Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:11:03.2012027Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:03.2012230Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:11:03.2012419Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:03.2012664Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:03.2012968Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:03.2013611Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:03.2013792Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:03.2013971Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:03.2014478Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.2015172Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.2015316Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:03.2015443Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:03.2015602Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:03.2015730Z U at::globalContext() 2025-05-07T20:11:03.2015882Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:11:03.2016010Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:03.2016143Z U bool at::Tensor::item() const 2025-05-07T20:11:03.2016238Z U c10::AnyType::get() 2025-05-07T20:11:03.2016411Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:03.2016621Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.2016779Z U c10::BoolType::get() 2025-05-07T20:11:03.2016946Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:03.2017248Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:03.2017382Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:03.2017865Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:03.2018461Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:03.2018823Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:03.2018932Z U c10::Error::what() const 2025-05-07T20:11:03.2019051Z U c10::GradMode::is_enabled() 2025-05-07T20:11:03.2019165Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:03.2019340Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.2019713Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:03.2019856Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:03.2019969Z U c10::IValue::isBoolList() const 2025-05-07T20:11:03.2020103Z U c10::IValue::isIntList() const 2025-05-07T20:11:03.2020227Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:03.2020345Z U c10::IValue::isTensorList() const 2025-05-07T20:11:03.2020483Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:03.2020604Z U c10::IntType::get() 2025-05-07T20:11:03.2021063Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:03.2021227Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:03.2021372Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:03.2021504Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:03.2021619Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:03.2021905Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:03.2022055Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:03.2022177Z U c10::StringType::get() 2025-05-07T20:11:03.2022338Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:03.2022721Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:03.2022857Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:03.2023181Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:03.2023298Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:03.2023404Z U c10::SymIntType::get() 2025-05-07T20:11:03.2023578Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:03.2023702Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:03.2024129Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:03.2024306Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:03.2024418Z U c10::TensorType::get() 2025-05-07T20:11:03.2024634Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:11:03.2024747Z U c10::Type::is_module() const 2025-05-07T20:11:03.2024902Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:03.2025599Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:03.2025764Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:03.2025890Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:03.2026012Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:03.2026133Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:03.2026278Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:03.2026387Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:03.2026634Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:03.2026752Z U c10::cuda::device_count() 2025-05-07T20:11:03.2026886Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:03.2027019Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:03.2028283Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:03.2028429Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:03.2028621Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:03.2028754Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:03.2029175Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:03.2029679Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:03.2029950Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:03.2030430Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:03.2030773Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:03.2031341Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:03.2031639Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:03.2031844Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:03.2031966Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:03.2032075Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:03.2032396Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:03.2032580Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:03.2032726Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:03.2033081Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:03.2033203Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:03.2033320Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:03.2033492Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:03.2033869Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:03.2034046Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:03.2034198Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:03.2034364Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:03.2034482Z U c10::throwNullDataPtrError() 2025-05-07T20:11:03.2034603Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:11:03.2034723Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:03.2034837Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:03.2035031Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:03.2035169Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:03.2035299Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:03.2035425Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:03.2035578Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:03.2035694Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:03.2035821Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:03.2035951Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:03.2036238Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:03.2036372Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:03.2036499Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:03.2036674Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:03.2036800Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:03.2036921Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:03.2037052Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:03.2037171Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:03.2037375Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:03.2037513Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:03.2037712Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:03.2037871Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.2037968Z U free@GLIBC_2.2.5 2025-05-07T20:11:03.2038125Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.2038218Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:03.2038334Z U long at::Tensor::item() const 2025-05-07T20:11:03.2038527Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:03.2038688Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.2038837Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.2038951Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:03.2039047Z U memcpy@GLIBC_2.14 2025-05-07T20:11:03.2039165Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:03.2039284Z U memset@GLIBC_2.2.5 2025-05-07T20:11:03.2039433Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:03.2039562Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:03.2039682Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:03.2039903Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:03.2040251Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:03.2040669Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:03.2041018Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:03.2041414Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:03.2041560Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:03.2041685Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:03.2041829Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.2041985Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.2042161Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:03.2042309Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:03.2042474Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:03.2042722Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:03.2043079Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:03.2043704Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.2044239Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.2044378Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:03.2044520Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:03.2044652Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:03.2044777Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:03.2044926Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:03.2045047Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:03.2045167Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:03.2045382Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.2045625Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.2045758Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:03.2045959Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:03.2046102Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:03.2046789Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:03.2046961Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:03.2047080Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:03.2047187Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:03.2047307Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:03.2047441Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:03.2048045Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:03.2048535Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:03.2048807Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:03.2048932Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:03.2049347Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:03.2049542Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:03.2049751Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:03.2049963Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:03.2050321Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:03.2050477Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:03.2050689Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:03.2050870Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:03.2050999Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:03.2051142Z U torch::autograd::Node::metadata() 2025-05-07T20:11:03.2051288Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:03.2051534Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:03.2051833Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:03.2052012Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:03.2052230Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:03.2052510Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:03.2055316Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:03.2055505Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:03.2055665Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:03.2055878Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:03.2056049Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:03.2056467Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:03.2056850Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:03.2057403Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:03.2057518Z U typeinfo for c10::Error 2025-05-07T20:11:03.2057666Z U typeinfo for c10::Type 2025-05-07T20:11:03.2057833Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:03.2057974Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:03.2058135Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:03.2058251Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:03.2058492Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:03.2058667Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:03.2058850Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:03.2059014Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:03.2059182Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:03.2059300Z U vtable for c10::Error 2025-05-07T20:11:03.2059410Z U vtable for c10::ListType 2025-05-07T20:11:03.2059768Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.2060122Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.2060466Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.2060604Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:03.2060830Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:03.2061097Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:03.2061235Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:03.2061378Z U vtable for torch::autograd::Node 2025-05-07T20:11:03.2061592Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:03.2061708Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:03.2061836Z w _ITM_registerTMCloneTable 2025-05-07T20:11:03.2061946Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:03.2062052Z w __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:11:03.2062170Z w __gmon_start__ 2025-05-07T20:11:03.2062279Z w __pthread_key_create 2025-05-07T20:11:03.2062398Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:03.2062515Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:03.2062716Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:03.2062949Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:03.2062956Z 2025-05-07T20:11:03.2063085Z linux-vdso.so.1 (0x00007ffcbcddd000) 2025-05-07T20:11:03.2063182Z libc10.so => not found 2025-05-07T20:11:03.2063296Z libc10_cuda.so => not found 2025-05-07T20:11:03.2063860Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f5331c50000) 2025-05-07T20:11:03.2064378Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f5330a00000) 2025-05-07T20:11:03.2064491Z libtorch.so => not found 2025-05-07T20:11:03.2064602Z libtorch_cpu.so => not found 2025-05-07T20:11:03.2064730Z libtorch_cuda.so => not found 2025-05-07T20:11:03.2064839Z libcudart.so.12 => not found 2025-05-07T20:11:03.2065012Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f533079c000) 2025-05-07T20:11:03.2065164Z libm.so.6 => /lib64/libm.so.6 (0x00007f5331b75000) 2025-05-07T20:11:03.2065316Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f53341a6000) 2025-05-07T20:11:03.2065444Z libc.so.6 => /lib64/libc.so.6 (0x00007f5330594000) 2025-05-07T20:11:03.2065581Z /lib64/ld-linux-x86-64.so.2 (0x00007f53341da000) 2025-05-07T20:11:03.2065691Z libc10.so => not found 2025-05-07T20:11:03.2065790Z libc10_cuda.so => not found 2025-05-07T20:11:03.2065888Z libtorch.so => not found 2025-05-07T20:11:03.2066005Z libtorch_cpu.so => not found 2025-05-07T20:11:03.2066105Z libtorch_cuda.so => not found 2025-05-07T20:11:03.2066226Z libcudart.so.12 => not found 2025-05-07T20:11:03.2066330Z libtorch.so => not found 2025-05-07T20:11:03.2066439Z libc10.so => not found 2025-05-07T20:11:03.2066544Z libc10_cuda.so => not found 2025-05-07T20:11:03.2066647Z libtorch_cpu.so => not found 2025-05-07T20:11:03.2066769Z libtorch_cuda.so => not found 2025-05-07T20:11:03.2066873Z libcudart.so.12 => not found 2025-05-07T20:11:03.2066878Z 2025-05-07T20:11:03.2066995Z [CHECK] Displaying ELF information: 2025-05-07T20:11:03.2067271Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:03.2067276Z 2025-05-07T20:11:03.2098863Z 2025-05-07T20:11:03.2099189Z Dynamic section at offset 0x2201930 contains 41 entries: 2025-05-07T20:11:03.2099361Z Tag Type Name/Value 2025-05-07T20:11:03.2099633Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:03.2099897Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:03.2100228Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:03.2100458Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:03.2100671Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:03.2100876Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:03.2101208Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:03.2101434Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:03.2101678Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:03.2101881Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:03.2102098Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:03.2102292Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:03.2102512Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:03.2102810Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:11:03.2103000Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:03.2103125Z 0x000000000000000c (INIT) 0x51000 2025-05-07T20:11:03.2103267Z 0x000000000000000d (FINI) 0x14a27c 2025-05-07T20:11:03.2103395Z 0x0000000000000019 (INIT_ARRAY) 0x2201bc8 2025-05-07T20:11:03.2103527Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:11:03.2103674Z 0x000000000000001a (FINI_ARRAY) 0x2201c58 2025-05-07T20:11:03.2103803Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:03.2103947Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:03.2104070Z 0x000000006ffffef5 (GNU_HASH) 0x2900 2025-05-07T20:11:03.2104202Z 0x0000000000000005 (STRTAB) 0xda20 2025-05-07T20:11:03.2104321Z 0x0000000000000006 (SYMTAB) 0x4f90 2025-05-07T20:11:03.2104466Z 0x000000000000000a (STRSZ) 224745 (bytes) 2025-05-07T20:11:03.2104613Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:03.2104737Z 0x0000000000000003 (PLTGOT) 0x2202c00 2025-05-07T20:11:03.2104878Z 0x0000000000000002 (PLTRELSZ) 11784 (bytes) 2025-05-07T20:11:03.2104998Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:03.2105138Z 0x0000000000000017 (JMPREL) 0x4da20 2025-05-07T20:11:03.2105253Z 0x0000000000000007 (RELA) 0x45518 2025-05-07T20:11:03.2105397Z 0x0000000000000008 (RELASZ) 34056 (bytes) 2025-05-07T20:11:03.2105549Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:03.2105661Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:03.2105791Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:03.2105935Z 0x000000006ffffffe (VERNEED) 0x45398 2025-05-07T20:11:03.2106082Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:03.2106204Z 0x000000006ffffff0 (VERSYM) 0x4480a 2025-05-07T20:11:03.2106320Z 0x000000006ffffff9 (RELACOUNT) 388 2025-05-07T20:11:03.2106450Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:03.2106455Z 2025-05-07T20:11:03.2106580Z ################################################################################ 2025-05-07T20:11:03.2106585Z 2025-05-07T20:11:03.2106590Z 2025-05-07T20:11:03.2106703Z ################################################################################ 2025-05-07T20:11:03.2106962Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:03.2107073Z [CHECK] Listing out library size: 2025-05-07T20:11:03.2107310Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:03.2107314Z 2025-05-07T20:11:03.2117767Z 74 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:03.2117779Z 2025-05-07T20:11:03.2118180Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:03.2118631Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:03.2118652Z 2025-05-07T20:11:03.2508953Z GLIBC_2.2.5 2025-05-07T20:11:03.2510021Z GLIBC_2.3 2025-05-07T20:11:03.2510793Z GLIBC_2.14 2025-05-07T20:11:03.2513386Z 2025-05-07T20:11:03.2513400Z 2025-05-07T20:11:03.2514691Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:03.2516336Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:03.2516923Z 2025-05-07T20:11:03.2885134Z GLIBCXX_3.4 2025-05-07T20:11:03.2885926Z GLIBCXX_3.4.9 2025-05-07T20:11:03.2886289Z GLIBCXX_3.4.11 2025-05-07T20:11:03.2886621Z GLIBCXX_3.4.14 2025-05-07T20:11:03.2886835Z GLIBCXX_3.4.15 2025-05-07T20:11:03.2887068Z GLIBCXX_3.4.18 2025-05-07T20:11:03.2887279Z GLIBCXX_3.4.19 2025-05-07T20:11:03.2887509Z GLIBCXX_3.4.20 2025-05-07T20:11:03.2887711Z GLIBCXX_3.4.21 2025-05-07T20:11:03.2887933Z GLIBCXX_3.4.29 2025-05-07T20:11:03.2890177Z 2025-05-07T20:11:03.2890184Z 2025-05-07T20:11:03.2910877Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.jH67xyeiZf.symbols.txt 2025-05-07T20:11:03.2912190Z 2025-05-07T20:11:03.3224609Z 2025-05-07T20:11:03.3249049Z [CHECK] Total Number of symbols: 6350 2025-05-07T20:11:03.3274307Z [CHECK] Number of fbgemm symbols: 4411 2025-05-07T20:11:03.3289725Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.AzBYoY5tgm.usymbols.txt 2025-05-07T20:11:03.3291436Z 2025-05-07T20:11:03.3323882Z 2025-05-07T20:11:03.3349559Z [CHECK] Listing out undefined symbols (483 total): 2025-05-07T20:11:03.3364650Z U GOMP_parallel 2025-05-07T20:11:03.3365833Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.3366681Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.3367371Z U VTT for std::basic_ifstream >@GLIBCXX_3.4 2025-05-07T20:11:03.3367946Z U VTT for std::basic_ofstream >@GLIBCXX_3.4 2025-05-07T20:11:03.3368426Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:03.3368762Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:11:03.3369158Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:03.3369585Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:03.3370028Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:03.3370596Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:03.3371005Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:03.3371378Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:03.3371787Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:03.3372190Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:03.3372542Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:03.3372903Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:03.3373238Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:03.3373616Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:03.3373966Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:03.3374333Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:03.3374709Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:03.3375041Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:03.3375406Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:03.3375746Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:03.3376099Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:03.3376448Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:11:03.3376923Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:03.3377373Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:03.3377831Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:03.3378323Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:03.3378897Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:03.3379313Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:11:03.3379681Z U at::SplitUntil32Bit::end() const 2025-05-07T20:11:03.3380335Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:11:03.3380741Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:11:03.3381246Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:11:03.3381802Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:03.3382285Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:11:03.3382767Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:11:03.3383187Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:11:03.3383617Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:11:03.3384024Z U at::TensorIteratorBase::numel() const 2025-05-07T20:11:03.3384575Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:11:03.3385063Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:11:03.3385593Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:11:03.3386052Z U at::TensorMaker::make_tensor() 2025-05-07T20:11:03.3386429Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:11:03.3386816Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:11:03.3387329Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:03.3387874Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:03.3388341Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:11:03.3388916Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:11:03.3389555Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:03.3390085Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:11:03.3390550Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:11:03.3391051Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:03.3391555Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:11:03.3392042Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:11:03.3392509Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:03.3392983Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:11:03.3393659Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:11:03.3394284Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:03.3395120Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.3396710Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.3397724Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:03.3398247Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:03.3398721Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:11:03.3399508Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.3400363Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:03.3400980Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:11:03.3401646Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:03.3402136Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:11:03.3402556Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:03.3403107Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:11:03.3403504Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:03.3404360Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.3405167Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:03.3405903Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.3406664Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:03.3407360Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:03.3407920Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:03.3408628Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:03.3409729Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:03.3410336Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:11:03.3410860Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:03.3411377Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:11:03.3411875Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:03.3412634Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:11:03.3413222Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:03.3413942Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:03.3415019Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:03.3415933Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:11:03.3416541Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:11:03.3417073Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:03.3417550Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:03.3418038Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:03.3418433Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:03.3419164Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.3420352Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.3421316Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:11:03.3421861Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:11:03.3422251Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:03.3422637Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:11:03.3423069Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:11:03.3423712Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:11:03.3424317Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:03.3424735Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:03.3425107Z U at::get_num_threads() 2025-05-07T20:11:03.3425447Z U at::get_thread_num() 2025-05-07T20:11:03.3425754Z U at::in_parallel_region() 2025-05-07T20:11:03.3426224Z U at::init_num_threads() 2025-05-07T20:11:03.3426762Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:11:03.3427204Z U at::internal::set_thread_num(int) 2025-05-07T20:11:03.3427669Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:11:03.3428543Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.3429784Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.3430803Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:03.3431303Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:11:03.3431692Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:03.3432086Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:11:03.3432489Z U bool at::Tensor::item() const 2025-05-07T20:11:03.3432862Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.3433247Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.3433615Z U c10::AnyType::get() 2025-05-07T20:11:03.3433961Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:03.3434402Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.3434896Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.3435294Z U c10::BoolType::get() 2025-05-07T20:11:03.3435603Z U c10::DeviceObjType::get() 2025-05-07T20:11:03.3435982Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:03.3436716Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:03.3437173Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:03.3438038Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:03.3439320Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:03.3440439Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:03.3441061Z U c10::Error::what() const 2025-05-07T20:11:03.3441397Z U c10::FloatType::get() 2025-05-07T20:11:03.3441707Z U c10::GradMode::is_enabled() 2025-05-07T20:11:03.3442056Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:03.3442439Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.3442904Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.3443415Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:03.3443804Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:03.3444165Z U c10::IValue::isBoolList() const 2025-05-07T20:11:03.3444505Z U c10::IValue::isIntList() const 2025-05-07T20:11:03.3444868Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:03.3445211Z U c10::IValue::isTensorList() const 2025-05-07T20:11:03.3445605Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:03.3445993Z U c10::InferenceMode::is_enabled() 2025-05-07T20:11:03.3446315Z U c10::IntType::get() 2025-05-07T20:11:03.3447287Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:03.3448056Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:03.3448489Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:03.3448886Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:03.3449331Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:03.3449810Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:03.3450284Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:03.3450668Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:03.3451024Z U c10::ScalarTypeType::get() 2025-05-07T20:11:03.3451548Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:03.3452279Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:11:03.3452872Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:03.3453274Z U c10::StringType::get() 2025-05-07T20:11:03.3453642Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:03.3454088Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:03.3454517Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:03.3455194Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:03.3455880Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:03.3456332Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:11:03.3456766Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:11:03.3457220Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:03.3457594Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:03.3457991Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:03.3458381Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:03.3458770Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:03.3459130Z U c10::SymIntType::get() 2025-05-07T20:11:03.3459563Z U c10::SymbolicShapeMeta::init_is_channels_last_3d_contiguous() const 2025-05-07T20:11:03.3460122Z U c10::SymbolicShapeMeta::init_is_channels_last_contiguous() const 2025-05-07T20:11:03.3460610Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:03.3461025Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:03.3461740Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:03.3462462Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:03.3462954Z U c10::TensorImpl::throw_storage_access_error() const 2025-05-07T20:11:03.3463331Z U c10::TensorType::get() 2025-05-07T20:11:03.3464352Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:11:03.3465474Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:11:03.3465903Z U c10::Type::is_module() const 2025-05-07T20:11:03.3466268Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:03.3467212Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:03.3468204Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:03.3468797Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:11:03.3469433Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:11:03.3470147Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:11:03.3470905Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:03.3471425Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:03.3471788Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:03.3472141Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:03.3472515Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:03.3472998Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:03.3473505Z U c10::cuda::current_device() 2025-05-07T20:11:03.3473849Z U c10::cuda::device_count() 2025-05-07T20:11:03.3474201Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:03.3474609Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:03.3474996Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:03.3475413Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:03.3475851Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:03.3476394Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:03.3477180Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:03.3478245Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:03.3479163Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:03.3480052Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:03.3481006Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:03.3482068Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:03.3483063Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:03.3483668Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:03.3484153Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:03.3484516Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:03.3485061Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:03.3485699Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:03.3486133Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:11:03.3486521Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:11:03.3486930Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:03.3487368Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:03.3487780Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:03.3488137Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:03.3488535Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:03.3489285Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:03.3489924Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:11:03.3490288Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:11:03.3490658Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:11:03.3491040Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:03.3491410Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:11:03.3491799Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:11:03.3492167Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:03.3492536Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:03.3492933Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:03.3493336Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:03.3493766Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:03.3494144Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:11:03.3494518Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:03.3494894Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:03.3495235Z U c10::report_overflow(char const*) 2025-05-07T20:11:03.3495584Z U c10::throwNullDataPtrError() 2025-05-07T20:11:03.3495960Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:11:03.3496307Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:03.3496657Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:03.3497099Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:03.3497536Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:03.3497875Z U cublasGemmStridedBatchedEx 2025-05-07T20:11:03.3498218Z U cublasSetStream_v2 2025-05-07T20:11:03.3498543Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:03.3498931Z U cudaDeviceGetByPCIBusId@libcudart.so.12 2025-05-07T20:11:03.3499296Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:03.3499679Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:03.3500050Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:03.3500405Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:03.3500768Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:03.3501099Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:03.3501468Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:03.3501814Z U cudaFree@libcudart.so.12 2025-05-07T20:11:03.3502369Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:03.3502736Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:03.3503073Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:03.3503427Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:03.3503784Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:03.3504158Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:03.3504493Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:03.3504863Z U cudaHostGetDevicePointer@libcudart.so.12 2025-05-07T20:11:03.3505220Z U cudaHostRegister@libcudart.so.12 2025-05-07T20:11:03.3505674Z U cudaHostUnregister@libcudart.so.12 2025-05-07T20:11:03.3506018Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:03.3506343Z U cudaMallocManaged@libcudart.so.12 2025-05-07T20:11:03.3506675Z U cudaMemAdvise@libcudart.so.12 2025-05-07T20:11:03.3507001Z U cudaMemPrefetchAsync@libcudart.so.12 2025-05-07T20:11:03.3507343Z U cudaMemcpy2DAsync@libcudart.so.12 2025-05-07T20:11:03.3507681Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:03.3508016Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:03.3508498Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:03.3508977Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:03.3509314Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:03.3509624Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:03.3509961Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:03.3510297Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:03.3510672Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.3511081Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.3511421Z U exit@GLIBC_2.2.5 2025-05-07T20:11:03.3511694Z U exp10@GLIBC_2.2.5 2025-05-07T20:11:03.3511950Z U exp@GLIBC_2.2.5 2025-05-07T20:11:03.3512220Z U expf@GLIBC_2.2.5 2025-05-07T20:11:03.3512572Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:03.3513052Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:03.3513565Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:03.3514035Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:03.3514543Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:03.3514963Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.3515357Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.3515691Z U fminf@GLIBC_2.2.5 2025-05-07T20:11:03.3515967Z U fmod@GLIBC_2.2.5 2025-05-07T20:11:03.3516325Z U free@GLIBC_2.2.5 2025-05-07T20:11:03.3516793Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:11:03.3517153Z U int at::Tensor::item() const 2025-05-07T20:11:03.3517543Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:11:03.3517966Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.3518350Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.3518711Z U lgamma@GLIBC_2.2.5 2025-05-07T20:11:03.3519007Z U llrint@GLIBC_2.2.5 2025-05-07T20:11:03.3519289Z U log10@GLIBC_2.2.5 2025-05-07T20:11:03.3519577Z U log2@GLIBC_2.2.5 2025-05-07T20:11:03.3519888Z U log@GLIBC_2.2.5 2025-05-07T20:11:03.3520200Z U long at::Tensor::item() const 2025-05-07T20:11:03.3520592Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:03.3521067Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:11:03.3521501Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.3521894Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.3522262Z U lrint@GLIBC_2.2.5 2025-05-07T20:11:03.3522542Z U madvise@GLIBC_2.2.5 2025-05-07T20:11:03.3522832Z U malloc@GLIBC_2.2.5 2025-05-07T20:11:03.3523116Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:03.3523411Z U memcpy@GLIBC_2.14 2025-05-07T20:11:03.3523699Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:03.3524021Z U memset@GLIBC_2.2.5 2025-05-07T20:11:03.3524329Z U nvmlDeviceGetCount_v2 2025-05-07T20:11:03.3524655Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:11:03.3525038Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:11:03.3525412Z U nvmlDeviceGetNvLinkState 2025-05-07T20:11:03.3525750Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:11:03.3526052Z U nvmlInit_v2 2025-05-07T20:11:03.3526331Z U omp_get_num_threads 2025-05-07T20:11:03.3526624Z U omp_get_thread_num 2025-05-07T20:11:03.3526980Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:03.3527385Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:03.3527747Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:03.3528106Z U pow@GLIBC_2.2.5 2025-05-07T20:11:03.3528398Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:03.3528882Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.3529327Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.3529735Z U sin@GLIBC_2.2.5 2025-05-07T20:11:03.3530127Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:03.3530598Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:03.3531067Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:11:03.3531546Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:03.3532189Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:03.3532844Z U std::__basic_file::~__basic_file()@GLIBCXX_3.4 2025-05-07T20:11:03.3533412Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:03.3534222Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:03.3535034Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:03.3535810Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:03.3536648Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:03.3537244Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:03.3537642Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:03.3538018Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:11:03.3538379Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:11:03.3538728Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:03.3539056Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:03.3539399Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:11:03.3539763Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:03.3540134Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.3540524Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.3540904Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.3541337Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:03.3541738Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:03.3542133Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:03.3542621Z U std::basic_filebuf >::basic_filebuf()@GLIBCXX_3.4 2025-05-07T20:11:03.3543149Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:11:03.3543787Z U std::basic_filebuf >::open(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:11:03.3544420Z U std::basic_filebuf >::~basic_filebuf()@GLIBCXX_3.4 2025-05-07T20:11:03.3544985Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:11:03.3545570Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:03.3546231Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:03.3547295Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:11:03.3548286Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.3549497Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.3550296Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:11:03.3550749Z U std::cout@GLIBCXX_3.4 2025-05-07T20:11:03.3551126Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:11:03.3551599Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:03.3551970Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:03.3552376Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:03.3552767Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:03.3553132Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:03.3553522Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:03.3553879Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:03.3554253Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:03.3554687Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:11:03.3555210Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.3555776Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.3556382Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:11:03.3575882Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:03.3576424Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:11:03.3576840Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:11:03.3577267Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:03.3577708Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:03.3578195Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:03.3579038Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:03.3580090Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:03.3580564Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:03.3580909Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:03.3581234Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:03.3581531Z U sysconf@GLIBC_2.2.5 2025-05-07T20:11:03.3581885Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:03.3582711Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:03.3583973Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:03.3585076Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:11:03.3585952Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:03.3586475Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:03.3587030Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:03.3587629Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:03.3588155Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:03.3588662Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:03.3589349Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:03.3589986Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:03.3590481Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:03.3591018Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:03.3591438Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:03.3591809Z U torch::autograd::Node::metadata() 2025-05-07T20:11:03.3592192Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:03.3592915Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:03.3593518Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:03.3594015Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:03.3594467Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:03.3594992Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:03.3598170Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:03.3603070Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:03.3603494Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:03.3603954Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:03.3604405Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:03.3605094Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:03.3606011Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:03.3606926Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:03.3607656Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:11:03.3608134Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:11:03.3609041Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:03.3609779Z U typeinfo for c10::Error 2025-05-07T20:11:03.3610093Z U typeinfo for c10::Type 2025-05-07T20:11:03.3610434Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:03.3610808Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:03.3611154Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:03.3611519Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:03.3611865Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:03.3612270Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.3612804Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.3613533Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:03.3614569Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:03.3615592Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:03.3616594Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:03.3617604Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:11:03.3618592Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:11:03.3619607Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:11:03.3620655Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:03.3621753Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:11:03.3622861Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:03.3624038Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:03.3624797Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:03.3625192Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:03.3625607Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:03.3625990Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:03.3626419Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:03.3626781Z U vtable for at::TensorIterator 2025-05-07T20:11:03.3627142Z U vtable for at::TensorIteratorBase 2025-05-07T20:11:03.3627470Z U vtable for c10::Error 2025-05-07T20:11:03.3627759Z U vtable for c10::ListType 2025-05-07T20:11:03.3628297Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.3629036Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.3629775Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.3630359Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:03.3630795Z U vtable for std::basic_filebuf >@GLIBCXX_3.4 2025-05-07T20:11:03.3631328Z U vtable for std::basic_ifstream >@GLIBCXX_3.4 2025-05-07T20:11:03.3631825Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:03.3632328Z U vtable for std::basic_ofstream >@GLIBCXX_3.4 2025-05-07T20:11:03.3632839Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:03.3633288Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:03.3633605Z U vtable for torch::autograd::Node 2025-05-07T20:11:03.3633967Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:03.3634361Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:03.3634657Z w _ITM_registerTMCloneTable 2025-05-07T20:11:03.3634938Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:03.3635233Z w __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:11:03.3635504Z w __gmon_start__ 2025-05-07T20:11:03.3635761Z w __pthread_key_create 2025-05-07T20:11:03.3636124Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:03.3636609Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:03.3636708Z w pthread_once 2025-05-07T20:11:03.3636864Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:03.3637070Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:03.3637078Z 2025-05-07T20:11:03.3637279Z linux-vdso.so.1 (0x00007ffcd7f42000) 2025-05-07T20:11:03.3637395Z libc10.so => not found 2025-05-07T20:11:03.3637502Z libc10_cuda.so => not found 2025-05-07T20:11:03.3637873Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f7a37800000) 2025-05-07T20:11:03.3638040Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:03.3638144Z libtorch.so => not found 2025-05-07T20:11:03.3638699Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f7a3cc0d000) 2025-05-07T20:11:03.3639190Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f7a36600000) 2025-05-07T20:11:03.3639299Z libtorch_cpu.so => not found 2025-05-07T20:11:03.3639399Z libtorch_cuda.so => not found 2025-05-07T20:11:03.3639495Z libcudart.so.12 => not found 2025-05-07T20:11:03.3639681Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f7a3639c000) 2025-05-07T20:11:03.3639809Z libm.so.6 => /lib64/libm.so.6 (0x00007f7a37725000) 2025-05-07T20:11:03.3639963Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f7a3cbdd000) 2025-05-07T20:11:03.3640117Z libc.so.6 => /lib64/libc.so.6 (0x00007f7a36194000) 2025-05-07T20:11:03.3640253Z /lib64/ld-linux-x86-64.so.2 (0x00007f7a3cdc3000) 2025-05-07T20:11:03.3640348Z libc10.so => not found 2025-05-07T20:11:03.3640725Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f7a37d85000) 2025-05-07T20:11:03.3640857Z libtorch.so => not found 2025-05-07T20:11:03.3640961Z libtorch_cpu.so => not found 2025-05-07T20:11:03.3641062Z libtorch_cuda.so => not found 2025-05-07T20:11:03.3641175Z libc10.so => not found 2025-05-07T20:11:03.3641272Z libc10_cuda.so => not found 2025-05-07T20:11:03.3641370Z libtorch.so => not found 2025-05-07T20:11:03.3641494Z libtorch_cpu.so => not found 2025-05-07T20:11:03.3641597Z libtorch_cuda.so => not found 2025-05-07T20:11:03.3641697Z libcudart.so.12 => not found 2025-05-07T20:11:03.3641794Z libtorch.so => not found 2025-05-07T20:11:03.3641907Z libc10.so => not found 2025-05-07T20:11:03.3642003Z libc10_cuda.so => not found 2025-05-07T20:11:03.3642105Z libtorch_cpu.so => not found 2025-05-07T20:11:03.3642237Z libtorch_cuda.so => not found 2025-05-07T20:11:03.3642336Z libcudart.so.12 => not found 2025-05-07T20:11:03.3642433Z libtorch_cpu.so => not found 2025-05-07T20:11:03.3642537Z libtorch_cuda.so => not found 2025-05-07T20:11:03.3642649Z libtorch.so => not found 2025-05-07T20:11:03.3642654Z 2025-05-07T20:11:03.3642766Z [CHECK] Displaying ELF information: 2025-05-07T20:11:03.3642973Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:03.3642979Z 2025-05-07T20:11:03.3642983Z 2025-05-07T20:11:03.3643171Z Dynamic section at offset 0x4953578 contains 43 entries: 2025-05-07T20:11:03.3643344Z Tag Type Name/Value 2025-05-07T20:11:03.3643546Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:03.3643799Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:03.3643987Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:03.3644204Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:03.3644428Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:03.3644678Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:03.3644907Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:03.3645113Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:03.3645334Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:03.3645546Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:03.3645754Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:03.3645961Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:03.3646162Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:03.3646382Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:03.3646820Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:03.3647040Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:11:03.3647230Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:03.3647371Z 0x000000000000000c (INIT) 0x18e000 2025-05-07T20:11:03.3647495Z 0x000000000000000d (FINI) 0x7e464c 2025-05-07T20:11:03.3647624Z 0x0000000000000019 (INIT_ARRAY) 0x494d470 2025-05-07T20:11:03.3647765Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:11:03.3647910Z 0x000000000000001a (FINI_ARRAY) 0x494d8f8 2025-05-07T20:11:03.3648039Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:03.3648149Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:03.3648290Z 0x000000006ffffef5 (GNU_HASH) 0x8530 2025-05-07T20:11:03.3648409Z 0x0000000000000005 (STRTAB) 0x363a0 2025-05-07T20:11:03.3648528Z 0x0000000000000006 (SYMTAB) 0x11038 2025-05-07T20:11:03.3648668Z 0x000000000000000a (STRSZ) 1209140 (bytes) 2025-05-07T20:11:03.3648887Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:03.3649012Z 0x0000000000000003 (PLTGOT) 0x4954868 2025-05-07T20:11:03.3649151Z 0x0000000000000002 (PLTRELSZ) 42168 (bytes) 2025-05-07T20:11:03.3649291Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:03.3649409Z 0x0000000000000017 (JMPREL) 0x183378 2025-05-07T20:11:03.3649524Z 0x0000000000000007 (RELA) 0x160a28 2025-05-07T20:11:03.3649688Z 0x0000000000000008 (RELASZ) 141648 (bytes) 2025-05-07T20:11:03.3649808Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:03.3649917Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:03.3650048Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:03.3650195Z 0x000000006ffffffe (VERNEED) 0x160878 2025-05-07T20:11:03.3650306Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:03.3650433Z 0x000000006ffffff0 (VERSYM) 0x15d6d4 2025-05-07T20:11:03.3650561Z 0x000000006ffffff9 (RELACOUNT) 516 2025-05-07T20:11:03.3650669Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:03.3650674Z 2025-05-07T20:11:03.3650799Z ################################################################################ 2025-05-07T20:11:03.3650804Z 2025-05-07T20:11:03.3650808Z 2025-05-07T20:11:03.3650978Z ################################################################################ 2025-05-07T20:11:03.3651290Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:03.3651438Z [CHECK] Listing out library size: 2025-05-07T20:11:03.3651767Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:03.3651775Z 2025-05-07T20:11:03.3652010Z 908 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:03.3652015Z 2025-05-07T20:11:03.3652439Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:03.3652990Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:03.3652996Z 2025-05-07T20:11:03.5326398Z GLIBC_2.2.5 2025-05-07T20:11:03.5326620Z GLIBC_2.3 2025-05-07T20:11:03.5326848Z GLIBC_2.14 2025-05-07T20:11:03.5327104Z 2025-05-07T20:11:03.5327126Z 2025-05-07T20:11:03.5328009Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:03.5328590Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:03.5330045Z 2025-05-07T20:11:03.7180023Z GLIBCXX_3.4 2025-05-07T20:11:03.7180695Z GLIBCXX_3.4.9 2025-05-07T20:11:03.7181393Z GLIBCXX_3.4.11 2025-05-07T20:11:03.7182009Z GLIBCXX_3.4.14 2025-05-07T20:11:03.7182515Z GLIBCXX_3.4.15 2025-05-07T20:11:03.7182746Z GLIBCXX_3.4.18 2025-05-07T20:11:03.7183002Z GLIBCXX_3.4.20 2025-05-07T20:11:03.7183231Z GLIBCXX_3.4.21 2025-05-07T20:11:03.7183487Z GLIBCXX_3.4.29 2025-05-07T20:11:03.7183741Z 2025-05-07T20:11:03.7183745Z 2025-05-07T20:11:03.7203155Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.wQCBvYe62n.symbols.txt 2025-05-07T20:11:03.7204605Z 2025-05-07T20:11:03.9003362Z 2025-05-07T20:11:03.9075373Z [CHECK] Total Number of symbols: 12349 2025-05-07T20:11:03.9194858Z [CHECK] Number of fbgemm symbols: 2031 2025-05-07T20:11:03.9213022Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.oTXbY4Z3db.usymbols.txt 2025-05-07T20:11:03.9213589Z 2025-05-07T20:11:03.9280536Z 2025-05-07T20:11:03.9312086Z [CHECK] Listing out undefined symbols (289 total): 2025-05-07T20:11:03.9335362Z U GOMP_parallel 2025-05-07T20:11:03.9336636Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.9337475Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.9338063Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:03.9338461Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:03.9338889Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:03.9339287Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:03.9339699Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:03.9340091Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:03.9340482Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:03.9340858Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:03.9341255Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:03.9341618Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:03.9341935Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:03.9342278Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:03.9342599Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:03.9343030Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:03.9343364Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:03.9343733Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:03.9344122Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:03.9344475Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:03.9344828Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:03.9345155Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:03.9345512Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:03.9345852Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:03.9346289Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:03.9346906Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:03.9347389Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:03.9347826Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:03.9348192Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:03.9348587Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:03.9349028Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:03.9349676Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:03.9350336Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:03.9351204Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.9352554Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.9353523Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:03.9354565Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.9355729Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:03.9356460Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:03.9356891Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:03.9357734Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.9358918Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:03.9359780Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:03.9360196Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:03.9360586Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:03.9361007Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:03.9361391Z U at::get_num_threads() 2025-05-07T20:11:03.9361709Z U at::get_thread_num() 2025-05-07T20:11:03.9362018Z U at::globalContext() 2025-05-07T20:11:03.9362342Z U at::in_parallel_region() 2025-05-07T20:11:03.9362662Z U at::init_num_threads() 2025-05-07T20:11:03.9363103Z U at::internal::set_thread_num(int) 2025-05-07T20:11:03.9363460Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:03.9363907Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:11:03.9364397Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:11:03.9364753Z U c10::AnyType::get() 2025-05-07T20:11:03.9365186Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.9365614Z U c10::BoolType::get() 2025-05-07T20:11:03.9366014Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:03.9366490Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:03.9366905Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:03.9367676Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:03.9368983Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:03.9370030Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:03.9370635Z U c10::Error::what() const 2025-05-07T20:11:03.9370931Z U c10::FloatType::get() 2025-05-07T20:11:03.9371264Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:03.9371582Z U c10::GradMode::is_enabled() 2025-05-07T20:11:03.9371913Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:03.9372293Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.9372712Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.9373160Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:03.9373531Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:03.9373878Z U c10::IValue::isBoolList() const 2025-05-07T20:11:03.9374213Z U c10::IValue::isIntList() const 2025-05-07T20:11:03.9374531Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:03.9374881Z U c10::IValue::isTensorList() const 2025-05-07T20:11:03.9375232Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:03.9375612Z U c10::IntType::get() 2025-05-07T20:11:03.9375977Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:03.9376358Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:03.9376718Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:03.9377064Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:03.9377516Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:03.9377964Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:03.9378319Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:03.9378817Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:03.9379324Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:03.9379697Z U c10::StringType::get() 2025-05-07T20:11:03.9380033Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:03.9380439Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:03.9381090Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:03.9381728Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:03.9382102Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:03.9382419Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:03.9382764Z U c10::SymIntType::get() 2025-05-07T20:11:03.9383132Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:03.9383499Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:03.9383885Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:03.9384231Z U c10::TensorType::get() 2025-05-07T20:11:03.9384569Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:03.9385453Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:03.9386375Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:03.9386744Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:03.9387078Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:03.9387430Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:03.9387755Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:03.9388112Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:03.9388551Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:03.9388980Z U c10::cuda::device_count() 2025-05-07T20:11:03.9389299Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:03.9389645Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:03.9390011Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:03.9390380Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:03.9390917Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:03.9391304Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:03.9391932Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:03.9392963Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:03.9394036Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:03.9394889Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:03.9395828Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:03.9396931Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:03.9397727Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:03.9398060Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:03.9398605Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:03.9399207Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:03.9399653Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:03.9400068Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:03.9400468Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:03.9400838Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:03.9401213Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:03.9401873Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:03.9402475Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:03.9402849Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:03.9403230Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:03.9403648Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:03.9404069Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:03.9404596Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:11:03.9404944Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:03.9405289Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:03.9405634Z U c10::throwNullDataPtrError() 2025-05-07T20:11:03.9405960Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:03.9406278Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:03.9406692Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:03.9407133Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:03.9407488Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:03.9407843Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:03.9408211Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:03.9408571Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:03.9408910Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:03.9409271Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:03.9409618Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:03.9410072Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:03.9410530Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:03.9410907Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:03.9411267Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:03.9411621Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:03.9411963Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:03.9412320Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:03.9412676Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:03.9413030Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:03.9414023Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:03.9415205Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:11:03.9415752Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:11:03.9416174Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:03.9416602Z U float at::Tensor::item() const 2025-05-07T20:11:03.9416971Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.9417387Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.9417738Z U free@GLIBC_2.2.5 2025-05-07T20:11:03.9418067Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.9418437Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.9418869Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:03.9419304Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:03.9419706Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:03.9420102Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:03.9420383Z U memcpy@GLIBC_2.14 2025-05-07T20:11:03.9420697Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:03.9420974Z U memset@GLIBC_2.2.5 2025-05-07T20:11:03.9421373Z U omp_get_num_threads 2025-05-07T20:11:03.9421637Z U omp_get_thread_num 2025-05-07T20:11:03.9421953Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:03.9422314Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:03.9422820Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:03.9423503Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:03.9424170Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:03.9424860Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:03.9425606Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:03.9426285Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:03.9426774Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:03.9427358Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:11:03.9428249Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:11:03.9428814Z U sqrt@GLIBC_2.2.5 2025-05-07T20:11:03.9429071Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:11:03.9429440Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:03.9430040Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:03.9430812Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:03.9431622Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:03.9432360Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:03.9432940Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:03.9433313Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:03.9433655Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:03.9433979Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:03.9434302Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:03.9434673Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.9435029Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.9435421Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:03.9435811Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:03.9436227Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:03.9436907Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:03.9437676Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:03.9438693Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.9439881Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:03.9440618Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:03.9440962Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:03.9441322Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:03.9441663Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:03.9442014Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:03.9442353Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:03.9442682Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:03.9443017Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:03.9443442Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.9443979Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:03.9444457Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:03.9444850Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:03.9445268Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:03.9445726Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:03.9446632Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:03.9447309Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:03.9447659Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:03.9447976Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:03.9448256Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:03.9448572Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:03.9449432Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:03.9450580Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:03.9451399Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:03.9451879Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:03.9452408Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:03.9452993Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:03.9453475Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:03.9453981Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:03.9454610Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:03.9455216Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:03.9455668Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:03.9456175Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:03.9456594Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:03.9456963Z U torch::autograd::Node::metadata() 2025-05-07T20:11:03.9457327Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:03.9457815Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:03.9458532Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:03.9459151Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:03.9459564Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:03.9460071Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:03.9463260Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:03.9466176Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:03.9466593Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:03.9467011Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:03.9467444Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:03.9468119Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:03.9468983Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:03.9469994Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:03.9470780Z U typeinfo for c10::Error 2025-05-07T20:11:03.9471118Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:03.9471495Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:03.9471849Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:03.9472222Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:03.9472581Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:03.9473833Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:11:03.9476088Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:11:03.9477444Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:03.9477866Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:03.9478343Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:03.9478774Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:03.9479144Z U vtable for c10::Error 2025-05-07T20:11:03.9479707Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.9480489Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.9481270Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:03.9481861Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:03.9482293Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:03.9482837Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:03.9483280Z U vtable for torch::autograd::Node 2025-05-07T20:11:03.9483679Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:03.9484111Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:03.9484431Z w _ITM_registerTMCloneTable 2025-05-07T20:11:03.9484762Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:03.9485055Z w __gmon_start__ 2025-05-07T20:11:03.9485350Z w __pthread_key_create 2025-05-07T20:11:03.9485658Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:03.9486005Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:03.9486375Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:03.9486892Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:03.9487249Z 2025-05-07T20:11:03.9487421Z linux-vdso.so.1 (0x00007fffb132d000) 2025-05-07T20:11:03.9487720Z libc10.so => not found 2025-05-07T20:11:03.9488004Z libc10_cuda.so => not found 2025-05-07T20:11:03.9488676Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f1f9c000000) 2025-05-07T20:11:03.9489804Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f1f9be50000) 2025-05-07T20:11:03.9490603Z libtorch.so => not found 2025-05-07T20:11:03.9491123Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f1f9b800000) 2025-05-07T20:11:03.9492063Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f1f9a600000) 2025-05-07T20:11:03.9492746Z libtorch_cpu.so => not found 2025-05-07T20:11:03.9493023Z libtorch_cuda.so => not found 2025-05-07T20:11:03.9493316Z libcudart.so.12 => not found 2025-05-07T20:11:03.9493660Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f1f9a39c000) 2025-05-07T20:11:03.9494078Z libm.so.6 => /lib64/libm.so.6 (0x00007f1fd65be000) 2025-05-07T20:11:03.9494471Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f1fd6590000) 2025-05-07T20:11:03.9494863Z libc.so.6 => /lib64/libc.so.6 (0x00007f1f9a194000) 2025-05-07T20:11:03.9495248Z /lib64/ld-linux-x86-64.so.2 (0x00007f1fd66a1000) 2025-05-07T20:11:03.9495579Z libc10.so => not found 2025-05-07T20:11:03.9495838Z libc10_cuda.so => not found 2025-05-07T20:11:03.9496460Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f1fd6584000) 2025-05-07T20:11:03.9497127Z libtorch.so => not found 2025-05-07T20:11:03.9497386Z libtorch_cpu.so => not found 2025-05-07T20:11:03.9497704Z libtorch_cuda.so => not found 2025-05-07T20:11:03.9497977Z libcudart.so.12 => not found 2025-05-07T20:11:03.9498268Z libc10.so => not found 2025-05-07T20:11:03.9498514Z libc10_cuda.so => not found 2025-05-07T20:11:03.9498826Z libtorch.so => not found 2025-05-07T20:11:03.9499105Z libtorch_cpu.so => not found 2025-05-07T20:11:03.9499372Z libtorch_cuda.so => not found 2025-05-07T20:11:03.9499655Z libcudart.so.12 => not found 2025-05-07T20:11:03.9499909Z libc10.so => not found 2025-05-07T20:11:03.9500439Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f1f9c385000) 2025-05-07T20:11:03.9501000Z libtorch.so => not found 2025-05-07T20:11:03.9501277Z libtorch_cpu.so => not found 2025-05-07T20:11:03.9501554Z libtorch_cuda.so => not found 2025-05-07T20:11:03.9501838Z libtorch.so => not found 2025-05-07T20:11:03.9502111Z libc10.so => not found 2025-05-07T20:11:03.9502359Z libc10_cuda.so => not found 2025-05-07T20:11:03.9502645Z libtorch_cpu.so => not found 2025-05-07T20:11:03.9502918Z libtorch_cuda.so => not found 2025-05-07T20:11:03.9503204Z libcudart.so.12 => not found 2025-05-07T20:11:03.9503471Z libc10.so => not found 2025-05-07T20:11:03.9503886Z libtorch_cpu.so => not found 2025-05-07T20:11:03.9504164Z libtorch_cuda.so => not found 2025-05-07T20:11:03.9504439Z libtorch.so => not found 2025-05-07T20:11:03.9504741Z libtorch_cpu.so => not found 2025-05-07T20:11:03.9505030Z libtorch_cuda.so => not found 2025-05-07T20:11:03.9505299Z libtorch.so => not found 2025-05-07T20:11:03.9505457Z 2025-05-07T20:11:03.9505579Z [CHECK] Displaying ELF information: 2025-05-07T20:11:03.9506083Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:03.9506462Z 2025-05-07T20:11:03.9506467Z 2025-05-07T20:11:03.9506635Z Dynamic section at offset 0x38b44998 contains 43 entries: 2025-05-07T20:11:03.9507162Z Tag Type Name/Value 2025-05-07T20:11:03.9507580Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:03.9508103Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:03.9508656Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:03.9509228Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:03.9509811Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:03.9510300Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:03.9510864Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:03.9511390Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:03.9511931Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:03.9512474Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:03.9512991Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:03.9513514Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:03.9514010Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:03.9514528Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:03.9515065Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:03.9515650Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:03.9516300Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:03.9516887Z 0x000000000000000c (INIT) 0x611000 2025-05-07T20:11:03.9517342Z 0x000000000000000d (FINI) 0x32390cc 2025-05-07T20:11:03.9517712Z 0x0000000000000019 (INIT_ARRAY) 0x38b425f8 2025-05-07T20:11:03.9518156Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:11:03.9518557Z 0x000000000000001a (FINI_ARRAY) 0x38b42d18 2025-05-07T20:11:03.9518923Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:03.9520437Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:03.9520794Z 0x000000006ffffef5 (GNU_HASH) 0x10330 2025-05-07T20:11:03.9521178Z 0x0000000000000005 (STRTAB) 0x69580 2025-05-07T20:11:03.9521527Z 0x0000000000000006 (SYMTAB) 0x20fb0 2025-05-07T20:11:03.9521918Z 0x000000000000000a (STRSZ) 4919620 (bytes) 2025-05-07T20:11:03.9522301Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:03.9522689Z 0x0000000000000003 (PLTGOT) 0x38b44c88 2025-05-07T20:11:03.9523093Z 0x0000000000000002 (PLTRELSZ) 50064 (bytes) 2025-05-07T20:11:03.9523458Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:03.9523825Z 0x0000000000000017 (JMPREL) 0x603da0 2025-05-07T20:11:03.9524174Z 0x0000000000000007 (RELA) 0x5208e0 2025-05-07T20:11:03.9524561Z 0x0000000000000008 (RELASZ) 931008 (bytes) 2025-05-07T20:11:03.9524938Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:03.9525297Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:03.9525635Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:03.9526059Z 0x000000006ffffffe (VERNEED) 0x520740 2025-05-07T20:11:03.9526428Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:03.9526767Z 0x000000006ffffff0 (VERSYM) 0x51a6c4 2025-05-07T20:11:03.9527144Z 0x000000006ffffff9 (RELACOUNT) 26208 2025-05-07T20:11:03.9527480Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:03.9527717Z 2025-05-07T20:11:03.9527841Z ################################################################################ 2025-05-07T20:11:03.9528079Z 2025-05-07T20:11:03.9528083Z 2025-05-07T20:11:03.9528227Z ################################################################################ 2025-05-07T20:11:03.9528896Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:03.9529457Z [CHECK] Listing out library size: 2025-05-07T20:11:03.9529957Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:03.9530398Z 2025-05-07T20:11:03.9530645Z 142 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:03.9531003Z 2025-05-07T20:11:03.9531457Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:03.9532559Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:03.9533384Z 2025-05-07T20:11:03.9753889Z GLIBC_2.2.5 2025-05-07T20:11:03.9754156Z GLIBC_2.3 2025-05-07T20:11:03.9754405Z GLIBC_2.14 2025-05-07T20:11:03.9754549Z 2025-05-07T20:11:03.9754555Z 2025-05-07T20:11:03.9755061Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:03.9756308Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:03.9757010Z 2025-05-07T20:11:04.0038910Z GLIBCXX_3.4 2025-05-07T20:11:04.0039576Z GLIBCXX_3.4.9 2025-05-07T20:11:04.0040211Z GLIBCXX_3.4.11 2025-05-07T20:11:04.0040855Z GLIBCXX_3.4.18 2025-05-07T20:11:04.0041444Z GLIBCXX_3.4.20 2025-05-07T20:11:04.0042052Z GLIBCXX_3.4.21 2025-05-07T20:11:04.0042630Z GLIBCXX_3.4.29 2025-05-07T20:11:04.0042994Z 2025-05-07T20:11:04.0043035Z 2025-05-07T20:11:04.0068603Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.eqj5pEE9je.symbols.txt 2025-05-07T20:11:04.0070182Z 2025-05-07T20:11:04.0307055Z 2025-05-07T20:11:04.0334670Z [CHECK] Total Number of symbols: 1624 2025-05-07T20:11:04.0354182Z [CHECK] Number of fbgemm symbols: 228 2025-05-07T20:11:04.0374963Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.HAGe1vaUnP.usymbols.txt 2025-05-07T20:11:04.0375543Z 2025-05-07T20:11:04.0398090Z 2025-05-07T20:11:04.0431578Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:11:04.0453869Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.0455752Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.0456371Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.0456760Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.0457173Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.0457601Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.0457990Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:04.0458394Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:04.0458777Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:04.0459296Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.0459682Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:04.0460018Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:04.0460379Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:04.0460702Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:04.0461055Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:04.0461400Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.0461761Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:04.0462125Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:04.0462532Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:04.0462994Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:04.0463437Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:04.0463928Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:04.0464802Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.0466181Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.0467155Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:04.0467894Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:04.0468768Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.0470031Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.0470843Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:04.0471231Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:04.0471573Z U at::globalContext() 2025-05-07T20:11:04.0471976Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.0472417Z U c10::BoolType::get() 2025-05-07T20:11:04.0472777Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:04.0473170Z U c10::FloatType::get() 2025-05-07T20:11:04.0473502Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:04.0473881Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.0474305Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:04.0474827Z U c10::IntType::get() 2025-05-07T20:11:04.0475204Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:04.0475620Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:04.0476071Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.0476508Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:04.0477132Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:04.0477852Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:04.0478549Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:04.0478936Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:04.0479336Z U c10::SymIntType::get() 2025-05-07T20:11:04.0479725Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:04.0480160Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.0480553Z U c10::TensorType::get() 2025-05-07T20:11:04.0480890Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:04.0481854Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:04.0482836Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:04.0483213Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:04.0483590Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:04.0483941Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:04.0484313Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:04.0484664Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:04.0485194Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:04.0485690Z U c10::cuda::device_count() 2025-05-07T20:11:04.0486045Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:04.0486457Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:04.0486858Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:04.0487276Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:04.0487713Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:04.0488112Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:04.0488882Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:04.0489767Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:04.0490658Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.0491617Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:04.0492686Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.0493543Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:04.0494021Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:04.0494395Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:04.0494836Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:04.0495236Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:04.0495625Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:04.0495770Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:04.0495911Z U c10::throwNullDataPtrError() 2025-05-07T20:11:04.0496024Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:04.0496144Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:04.0496350Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:04.0496478Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:04.0496615Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:04.0496779Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:04.0496941Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:04.0497068Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:04.0497201Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:04.0497344Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:04.0497466Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:04.0497594Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:04.0497742Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:04.0497891Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:04.0498023Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:04.0498149Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:04.0498292Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:04.0498413Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:04.0498552Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:04.0498698Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:04.0500927Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:04.0501143Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:04.0501263Z U float at::Tensor::item() const 2025-05-07T20:11:04.0501404Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.0501579Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.0501707Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.0501843Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.0502033Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:04.0502168Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.0502332Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.0502453Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.0502551Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.0502715Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.0502864Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.0503015Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.0503494Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.0503805Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.0504159Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.0504483Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.0504830Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:04.0505240Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:04.0505606Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:04.0505998Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:04.0506130Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.0506257Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:04.0506414Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.0506578Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.0506762Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:04.0506905Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:04.0507179Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:04.0507525Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:04.0508110Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.0508818Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.0508946Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:04.0509104Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:04.0509238Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.0509372Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.0509515Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:04.0509645Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:04.0509842Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.0510110Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.0510253Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:04.0510377Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:04.0510492Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.0510678Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:04.0511303Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:04.0511792Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.0512058Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.0512437Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:04.0513100Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:04.0514526Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.0516076Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.0517439Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.0518816Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.0520192Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.0521560Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.0523557Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.0525537Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.0527443Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.0529411Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.0531164Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.0532914Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.0534536Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:11:04.0534689Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.0534852Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:04.0535022Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.0535352Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.0535666Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.0536034Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.0536229Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:04.0536470Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:04.0536606Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.0536718Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.0536824Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.0536942Z w __gmon_start__ 2025-05-07T20:11:04.0537044Z w __pthread_key_create 2025-05-07T20:11:04.0537330Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:04.0537451Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:04.0537628Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.0537885Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:04.0537891Z 2025-05-07T20:11:04.0538055Z linux-vdso.so.1 (0x00007ffffcbd9000) 2025-05-07T20:11:04.0538308Z libc10.so => not found 2025-05-07T20:11:04.0538407Z libc10_cuda.so => not found 2025-05-07T20:11:04.0538973Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fc559800000) 2025-05-07T20:11:04.0539128Z libtorch.so => not found 2025-05-07T20:11:04.0539230Z libtorch_cpu.so => not found 2025-05-07T20:11:04.0539330Z libtorch_cuda.so => not found 2025-05-07T20:11:04.0539462Z libcudart.so.12 => not found 2025-05-07T20:11:04.0539625Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc55959c000) 2025-05-07T20:11:04.0539782Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc59cbca000) 2025-05-07T20:11:04.0539931Z libc.so.6 => /lib64/libc.so.6 (0x00007fc559394000) 2025-05-07T20:11:04.0540066Z /lib64/ld-linux-x86-64.so.2 (0x00007fc59cbfe000) 2025-05-07T20:11:04.0540166Z libc10.so => not found 2025-05-07T20:11:04.0540269Z libc10_cuda.so => not found 2025-05-07T20:11:04.0540754Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fc559000000) 2025-05-07T20:11:04.0541290Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fc558e50000) 2025-05-07T20:11:04.0541391Z libtorch.so => not found 2025-05-07T20:11:04.0541759Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007fc558800000) 2025-05-07T20:11:04.0542233Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fc557600000) 2025-05-07T20:11:04.0542336Z libtorch_cpu.so => not found 2025-05-07T20:11:04.0542463Z libtorch_cuda.so => not found 2025-05-07T20:11:04.0542563Z libcudart.so.12 => not found 2025-05-07T20:11:04.0542697Z libm.so.6 => /lib64/libm.so.6 (0x00007fc59caeb000) 2025-05-07T20:11:04.0542818Z libc10.so => not found 2025-05-07T20:11:04.0542917Z libc10_cuda.so => not found 2025-05-07T20:11:04.0543350Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007fc59cadd000) 2025-05-07T20:11:04.0543455Z libtorch.so => not found 2025-05-07T20:11:04.0543577Z libtorch_cpu.so => not found 2025-05-07T20:11:04.0543680Z libtorch_cuda.so => not found 2025-05-07T20:11:04.0543783Z libcudart.so.12 => not found 2025-05-07T20:11:04.0543899Z libc10.so => not found 2025-05-07T20:11:04.0543995Z libc10_cuda.so => not found 2025-05-07T20:11:04.0544098Z libtorch.so => not found 2025-05-07T20:11:04.0544196Z libtorch_cpu.so => not found 2025-05-07T20:11:04.0544316Z libtorch_cuda.so => not found 2025-05-07T20:11:04.0544412Z libcudart.so.12 => not found 2025-05-07T20:11:04.0544506Z libc10.so => not found 2025-05-07T20:11:04.0544914Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007fc59ca5e000) 2025-05-07T20:11:04.0545010Z libtorch.so => not found 2025-05-07T20:11:04.0545111Z libtorch_cpu.so => not found 2025-05-07T20:11:04.0545216Z libtorch_cuda.so => not found 2025-05-07T20:11:04.0545359Z libtorch.so => not found 2025-05-07T20:11:04.0545456Z libc10.so => not found 2025-05-07T20:11:04.0545552Z libc10_cuda.so => not found 2025-05-07T20:11:04.0545673Z libtorch_cpu.so => not found 2025-05-07T20:11:04.0545769Z libtorch_cuda.so => not found 2025-05-07T20:11:04.0545867Z libcudart.so.12 => not found 2025-05-07T20:11:04.0545961Z libc10.so => not found 2025-05-07T20:11:04.0546081Z libtorch_cpu.so => not found 2025-05-07T20:11:04.0546182Z libtorch_cuda.so => not found 2025-05-07T20:11:04.0546282Z libtorch.so => not found 2025-05-07T20:11:04.0546551Z libtorch_cpu.so => not found 2025-05-07T20:11:04.0546653Z libtorch_cuda.so => not found 2025-05-07T20:11:04.0546748Z libtorch.so => not found 2025-05-07T20:11:04.0546753Z 2025-05-07T20:11:04.0547056Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.0547358Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:04.0547362Z 2025-05-07T20:11:04.0551290Z 2025-05-07T20:11:04.0551478Z Dynamic section at offset 0x8dbfdd8 contains 39 entries: 2025-05-07T20:11:04.0551632Z Tag Type Name/Value 2025-05-07T20:11:04.0551932Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:04.0552141Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:04.0552427Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:04.0552632Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.0552842Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.0553076Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.0553295Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:04.0553508Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.0553737Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.0553934Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.0554158Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:04.0554442Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:11:04.0554691Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:04.0554814Z 0x000000000000000c (INIT) 0xbf000 2025-05-07T20:11:04.0554938Z 0x000000000000000d (FINI) 0x62dd0c 2025-05-07T20:11:04.0555085Z 0x0000000000000019 (INIT_ARRAY) 0x8dbf998 2025-05-07T20:11:04.0555225Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:11:04.0555353Z 0x000000000000001a (FINI_ARRAY) 0x8dbfa60 2025-05-07T20:11:04.0555504Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.0555621Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:04.0555746Z 0x000000006ffffef5 (GNU_HASH) 0x2b38 2025-05-07T20:11:04.0555868Z 0x0000000000000005 (STRTAB) 0xedf0 2025-05-07T20:11:04.0556069Z 0x0000000000000006 (SYMTAB) 0x5598 2025-05-07T20:11:04.0556238Z 0x000000000000000a (STRSZ) 594745 (bytes) 2025-05-07T20:11:04.0556368Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.0556513Z 0x0000000000000003 (PLTGOT) 0x8dc0088 2025-05-07T20:11:04.0556654Z 0x0000000000000002 (PLTRELSZ) 11400 (bytes) 2025-05-07T20:11:04.0556774Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.0556905Z 0x0000000000000017 (JMPREL) 0xbb9f8 2025-05-07T20:11:04.0557085Z 0x0000000000000007 (RELA) 0xa0f20 2025-05-07T20:11:04.0557230Z 0x0000000000000008 (RELASZ) 109272 (bytes) 2025-05-07T20:11:04.0557360Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.0557524Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.0557660Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.0557788Z 0x000000006ffffffe (VERNEED) 0xa0de0 2025-05-07T20:11:04.0557928Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:04.0558050Z 0x000000006ffffff0 (VERSYM) 0xa012a 2025-05-07T20:11:04.0558176Z 0x000000006ffffff9 (RELACOUNT) 3126 2025-05-07T20:11:04.0558309Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.0558314Z 2025-05-07T20:11:04.0558438Z ################################################################################ 2025-05-07T20:11:04.0558443Z 2025-05-07T20:11:04.0558446Z 2025-05-07T20:11:04.0558564Z ################################################################################ 2025-05-07T20:11:04.0558929Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:04.0559045Z [CHECK] Listing out library size: 2025-05-07T20:11:04.0559377Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:04.0559408Z 2025-05-07T20:11:04.0566256Z 59 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:04.0567356Z 2025-05-07T20:11:04.0568521Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:04.0569110Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.0569115Z 2025-05-07T20:11:04.0726303Z GLIBC_2.2.5 2025-05-07T20:11:04.0726455Z GLIBC_2.3 2025-05-07T20:11:04.0726575Z GLIBC_2.14 2025-05-07T20:11:04.0731851Z 2025-05-07T20:11:04.0731947Z 2025-05-07T20:11:04.0732515Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:04.0733112Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.0733124Z 2025-05-07T20:11:04.0884102Z GLIBCXX_3.4 2025-05-07T20:11:04.0884826Z GLIBCXX_3.4.9 2025-05-07T20:11:04.0884941Z GLIBCXX_3.4.11 2025-05-07T20:11:04.0885220Z GLIBCXX_3.4.15 2025-05-07T20:11:04.0885359Z GLIBCXX_3.4.18 2025-05-07T20:11:04.0885457Z GLIBCXX_3.4.20 2025-05-07T20:11:04.0885565Z GLIBCXX_3.4.21 2025-05-07T20:11:04.0885698Z GLIBCXX_3.4.29 2025-05-07T20:11:04.0888931Z 2025-05-07T20:11:04.0888936Z 2025-05-07T20:11:04.0910179Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.WdV4NNjnre.symbols.txt 2025-05-07T20:11:04.0910505Z 2025-05-07T20:11:04.1033106Z 2025-05-07T20:11:04.1057579Z [CHECK] Total Number of symbols: 1791 2025-05-07T20:11:04.1078032Z [CHECK] Number of fbgemm symbols: 94 2025-05-07T20:11:04.1098186Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.3yYAEOVa33.usymbols.txt 2025-05-07T20:11:04.1098280Z 2025-05-07T20:11:04.1123323Z 2025-05-07T20:11:04.1158107Z [CHECK] Listing out undefined symbols (266 total): 2025-05-07T20:11:04.1175075Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.1175540Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.1175669Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.1175831Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.1176140Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.1176302Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.1176504Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:04.1176641Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:04.1176804Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:04.1176955Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.1177084Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:04.1177232Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:04.1177356Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:04.1177471Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:04.1177594Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:04.1177740Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:04.1177857Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:04.1177978Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:04.1178117Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:04.1178242Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:04.1178354Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:04.1178504Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.1178660Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:04.1178784Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:04.1178946Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:04.1179162Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:04.1179303Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:04.1179425Z U at::RecordFunction::end() 2025-05-07T20:11:04.1179590Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:04.1179748Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:04.1179942Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:04.1180145Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:04.1180732Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.1181378Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.1181731Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:04.1182199Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.1182880Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.1183014Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:04.1183141Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:04.1183317Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:04.1183426Z U at::globalContext() 2025-05-07T20:11:04.1183554Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:04.1183678Z U c10::AnyType::get() 2025-05-07T20:11:04.1183876Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.1183978Z U c10::BoolType::get() 2025-05-07T20:11:04.1184166Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:04.1184362Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:04.1184506Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:04.1184991Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:04.1185597Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:04.1185951Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:04.1186079Z U c10::Error::what() const 2025-05-07T20:11:04.1186181Z U c10::FloatType::get() 2025-05-07T20:11:04.1186294Z U c10::GradMode::is_enabled() 2025-05-07T20:11:04.1186410Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:04.1186600Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.1186756Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:04.1186872Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:04.1187036Z U c10::IValue::isBoolList() const 2025-05-07T20:11:04.1187146Z U c10::IValue::isIntList() const 2025-05-07T20:11:04.1187268Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:04.1187409Z U c10::IValue::isTensorList() const 2025-05-07T20:11:04.1187551Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:04.1187657Z U c10::IntType::get() 2025-05-07T20:11:04.1187838Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:04.1187961Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:04.1188090Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:04.1188241Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:04.1188455Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:04.1188721Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:04.1188897Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.1189052Z U c10::StringType::get() 2025-05-07T20:11:04.1189197Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:04.1189362Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:04.1189532Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:04.1189682Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:04.1189837Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:04.1190240Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:04.1190379Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:04.1190541Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:04.1190683Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:04.1190809Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:04.1190945Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:04.1191100Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:04.1191233Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:04.1191370Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:04.1191501Z U c10::SymIntType::get() 2025-05-07T20:11:04.1191653Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:04.1191802Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:04.1191980Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.1192090Z U c10::TensorType::get() 2025-05-07T20:11:04.1192216Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:04.1192902Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:04.1193040Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:04.1193161Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:04.1193308Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:04.1193426Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:04.1193549Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:04.1193692Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:04.1193938Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:04.1194075Z U c10::cuda::device_count() 2025-05-07T20:11:04.1194236Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:04.1194373Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:04.1194517Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:04.1194662Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:04.1194838Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:04.1194959Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:04.1195368Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:04.1195873Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:04.1196230Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:04.1196955Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.1197307Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:04.1197893Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.1198045Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:04.1198165Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:04.1198493Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:04.1198708Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:04.1198865Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:04.1199040Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:04.1199186Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:04.1199314Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:04.1199477Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:04.1199903Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:04.1200059Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:04.1200213Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:04.1200376Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:04.1200548Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:04.1200706Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:04.1200878Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:04.1201004Z U c10::throwNullDataPtrError() 2025-05-07T20:11:04.1201122Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:04.1201262Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:04.1201464Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:04.1201594Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:04.1201736Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:04.1201892Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:04.1202034Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:04.1202185Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:04.1202336Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:04.1202463Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:04.1202589Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:04.1202742Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:04.1202870Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:04.1203014Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:04.1203147Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:04.1203292Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:04.1203417Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:04.1203543Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:04.1203699Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:04.1203834Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:04.1206030Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:04.1206326Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:04.1206480Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.1206669Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.1206781Z U free@GLIBC_2.2.5 2025-05-07T20:11:04.1206918Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.1207099Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.1207288Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:04.1207428Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.1207611Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.1207727Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:04.1207853Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.1207960Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.1208086Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.1208267Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.1208397Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.1222867Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.1223296Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.1223427Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:04.1223741Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:04.1224289Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:04.1224687Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:04.1225033Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:04.1225476Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:04.1225865Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:04.1226269Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:04.1226402Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.1226527Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:04.1226710Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.1226862Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.1227052Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:04.1227216Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:04.1227375Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:04.1227625Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:04.1228058Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:04.1228641Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.1229159Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.1229323Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:04.1229454Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:04.1229588Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:04.1229738Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.1229869Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.1229993Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:04.1230141Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:04.1230334Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.1230616Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.1230778Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:04.1230986Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:04.1231132Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:04.1231592Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:04.1231743Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:04.1231868Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:04.1231996Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:04.1232103Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.1232244Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:04.1232864Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:04.1233336Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.1233605Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.1233790Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:04.1234093Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:04.1234353Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:04.1234568Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:04.1234763Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:04.1235138Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:04.1235299Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:04.1235496Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:04.1235704Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:04.1235833Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:04.1235958Z U torch::autograd::Node::metadata() 2025-05-07T20:11:04.1236238Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:04.1236513Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:04.1236790Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:04.1236966Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:04.1237220Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:04.1237446Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:04.1240140Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:04.1240339Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:04.1240516Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:04.1240692Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:04.1241490Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:04.1241663Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:04.1242079Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:04.1242468Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:04.1243021Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:04.1243164Z U typeinfo for c10::Error 2025-05-07T20:11:04.1243332Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:04.1243467Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:04.1243609Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:04.1243772Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:04.1243902Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:04.1245322Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.1246945Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.1248367Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.1249755Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.1251128Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.1252553Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:04.1252745Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.1252923Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:04.1253090Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:04.1253282Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.1253399Z U vtable for c10::Error 2025-05-07T20:11:04.1253757Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.1254122Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.1254475Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.1254620Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:04.1254882Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:04.1255121Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:04.1255249Z U vtable for torch::autograd::Node 2025-05-07T20:11:04.1255455Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:04.1255577Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.1255698Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.1255816Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.1255942Z w __gmon_start__ 2025-05-07T20:11:04.1256050Z w __pthread_key_create 2025-05-07T20:11:04.1256175Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:04.1256318Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:04.1256473Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.1256751Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:04.1256758Z 2025-05-07T20:11:04.1256932Z linux-vdso.so.1 (0x00007fff73bd2000) 2025-05-07T20:11:04.1257031Z libc10.so => not found 2025-05-07T20:11:04.1257139Z libc10_cuda.so => not found 2025-05-07T20:11:04.1257715Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007facaac00000) 2025-05-07T20:11:04.1257837Z libtorch.so => not found 2025-05-07T20:11:04.1257949Z libtorch_cpu.so => not found 2025-05-07T20:11:04.1258061Z libtorch_cuda.so => not found 2025-05-07T20:11:04.1258192Z libcudart.so.12 => not found 2025-05-07T20:11:04.1258367Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007facaa99c000) 2025-05-07T20:11:04.1258528Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007face8ad2000) 2025-05-07T20:11:04.1258687Z libc.so.6 => /lib64/libc.so.6 (0x00007facaa794000) 2025-05-07T20:11:04.1258821Z /lib64/ld-linux-x86-64.so.2 (0x00007face8b06000) 2025-05-07T20:11:04.1258923Z libc10.so => not found 2025-05-07T20:11:04.1259095Z libc10_cuda.so => not found 2025-05-07T20:11:04.1259596Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007facaa400000) 2025-05-07T20:11:04.1260142Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007facaa250000) 2025-05-07T20:11:04.1260245Z libtorch.so => not found 2025-05-07T20:11:04.1260681Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007faca9c00000) 2025-05-07T20:11:04.1261169Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007faca8a00000) 2025-05-07T20:11:04.1261277Z libtorch_cpu.so => not found 2025-05-07T20:11:04.1261383Z libtorch_cuda.so => not found 2025-05-07T20:11:04.1261506Z libcudart.so.12 => not found 2025-05-07T20:11:04.1261646Z libm.so.6 => /lib64/libm.so.6 (0x00007facaa175000) 2025-05-07T20:11:04.1261744Z libc10.so => not found 2025-05-07T20:11:04.1261864Z libc10_cuda.so => not found 2025-05-07T20:11:04.1262304Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007face8ac0000) 2025-05-07T20:11:04.1262406Z libtorch.so => not found 2025-05-07T20:11:04.1262530Z libtorch_cpu.so => not found 2025-05-07T20:11:04.1262638Z libtorch_cuda.so => not found 2025-05-07T20:11:04.1262741Z libcudart.so.12 => not found 2025-05-07T20:11:04.1262833Z libc10.so => not found 2025-05-07T20:11:04.1262954Z libc10_cuda.so => not found 2025-05-07T20:11:04.1263056Z libtorch.so => not found 2025-05-07T20:11:04.1263160Z libtorch_cpu.so => not found 2025-05-07T20:11:04.1263283Z libtorch_cuda.so => not found 2025-05-07T20:11:04.1263407Z libcudart.so.12 => not found 2025-05-07T20:11:04.1263509Z libc10.so => not found 2025-05-07T20:11:04.1263873Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007face8a41000) 2025-05-07T20:11:04.1263999Z libtorch.so => not found 2025-05-07T20:11:04.1264107Z libtorch_cpu.so => not found 2025-05-07T20:11:04.1264211Z libtorch_cuda.so => not found 2025-05-07T20:11:04.1264333Z libtorch.so => not found 2025-05-07T20:11:04.1264436Z libc10.so => not found 2025-05-07T20:11:04.1264534Z libc10_cuda.so => not found 2025-05-07T20:11:04.1264642Z libtorch_cpu.so => not found 2025-05-07T20:11:04.1264771Z libtorch_cuda.so => not found 2025-05-07T20:11:04.1264872Z libcudart.so.12 => not found 2025-05-07T20:11:04.1264975Z libc10.so => not found 2025-05-07T20:11:04.1265099Z libtorch_cpu.so => not found 2025-05-07T20:11:04.1265203Z libtorch_cuda.so => not found 2025-05-07T20:11:04.1265303Z libtorch.so => not found 2025-05-07T20:11:04.1265405Z libtorch_cpu.so => not found 2025-05-07T20:11:04.1265530Z libtorch_cuda.so => not found 2025-05-07T20:11:04.1265625Z libtorch.so => not found 2025-05-07T20:11:04.1265630Z 2025-05-07T20:11:04.1265745Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.1266087Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:04.1266092Z 2025-05-07T20:11:04.1266125Z 2025-05-07T20:11:04.1266294Z Dynamic section at offset 0x3a22e50 contains 39 entries: 2025-05-07T20:11:04.1266422Z Tag Type Name/Value 2025-05-07T20:11:04.1266653Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:04.1266857Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:04.1267124Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:04.1267345Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.1267554Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.1267766Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.1267996Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:04.1268202Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.1268405Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.1268627Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.1268866Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:04.1269150Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:11:04.1269384Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:04.1269506Z 0x000000000000000c (INIT) 0x7a000 2025-05-07T20:11:04.1269631Z 0x000000000000000d (FINI) 0x26a70c 2025-05-07T20:11:04.1269758Z 0x0000000000000019 (INIT_ARRAY) 0x3a23350 2025-05-07T20:11:04.1269918Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:11:04.1270048Z 0x000000000000001a (FINI_ARRAY) 0x3a23408 2025-05-07T20:11:04.1270176Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.1270312Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:04.1270442Z 0x000000006ffffef5 (GNU_HASH) 0x2e00 2025-05-07T20:11:04.1270558Z 0x0000000000000005 (STRTAB) 0x101c8 2025-05-07T20:11:04.1270679Z 0x0000000000000006 (SYMTAB) 0x59c8 2025-05-07T20:11:04.1270850Z 0x000000000000000a (STRSZ) 353759 (bytes) 2025-05-07T20:11:04.1270981Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.1271108Z 0x0000000000000003 (PLTGOT) 0x3a24100 2025-05-07T20:11:04.1271272Z 0x0000000000000002 (PLTRELSZ) 13056 (bytes) 2025-05-07T20:11:04.1271413Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.1271536Z 0x0000000000000017 (JMPREL) 0x75e68 2025-05-07T20:11:04.1271675Z 0x0000000000000007 (RELA) 0x67708 2025-05-07T20:11:04.1271818Z 0x0000000000000008 (RELASZ) 59232 (bytes) 2025-05-07T20:11:04.1271945Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.1272056Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.1272214Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.1272340Z 0x000000006ffffffe (VERNEED) 0x675a8 2025-05-07T20:11:04.1272464Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:04.1272718Z 0x000000006ffffff0 (VERSYM) 0x667a8 2025-05-07T20:11:04.1272835Z 0x000000006ffffff9 (RELACOUNT) 1167 2025-05-07T20:11:04.1272945Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.1272949Z 2025-05-07T20:11:04.1273092Z ################################################################################ 2025-05-07T20:11:04.1273099Z 2025-05-07T20:11:04.1273102Z 2025-05-07T20:11:04.1273221Z ################################################################################ 2025-05-07T20:11:04.1273570Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:04.1273706Z [CHECK] Listing out library size: 2025-05-07T20:11:04.1274178Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:04.1274182Z 2025-05-07T20:11:04.1275034Z 329 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:04.1275056Z 2025-05-07T20:11:04.1275533Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:04.1276269Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.1276285Z 2025-05-07T20:11:04.1867640Z GLIBC_2.2.5 2025-05-07T20:11:04.1868497Z GLIBC_2.3 2025-05-07T20:11:04.1868762Z GLIBC_2.14 2025-05-07T20:11:04.1868808Z 2025-05-07T20:11:04.1868822Z 2025-05-07T20:11:04.1870244Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:04.1871968Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:04.1871985Z 2025-05-07T20:11:04.2464650Z GLIBCXX_3.4 2025-05-07T20:11:04.2465372Z GLIBCXX_3.4.9 2025-05-07T20:11:04.2466086Z GLIBCXX_3.4.11 2025-05-07T20:11:04.2466317Z GLIBCXX_3.4.18 2025-05-07T20:11:04.2466574Z GLIBCXX_3.4.20 2025-05-07T20:11:04.2466867Z GLIBCXX_3.4.21 2025-05-07T20:11:04.2467128Z GLIBCXX_3.4.29 2025-05-07T20:11:04.2467266Z 2025-05-07T20:11:04.2467277Z 2025-05-07T20:11:04.2484623Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.0n5gUYaenF.symbols.txt 2025-05-07T20:11:04.2486251Z 2025-05-07T20:11:04.3039854Z 2025-05-07T20:11:04.3074723Z [CHECK] Total Number of symbols: 3670 2025-05-07T20:11:04.3127927Z [CHECK] Number of fbgemm symbols: 456 2025-05-07T20:11:04.3151080Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.iU9ZMvKwQv.usymbols.txt 2025-05-07T20:11:04.3152693Z 2025-05-07T20:11:04.3183034Z 2025-05-07T20:11:04.3209788Z [CHECK] Listing out undefined symbols (185 total): 2025-05-07T20:11:04.3225720Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.3226979Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.3227557Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:04.3228204Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.3228608Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:04.3229016Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.3229405Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:04.3229819Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:04.3230226Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:04.3230613Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:04.3231008Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:04.3231348Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:04.3231714Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:04.3232045Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:04.3232402Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:04.3232893Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:04.3233230Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:04.3233693Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:04.3234139Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:04.3234564Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:04.3234980Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:04.3235441Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:04.3235930Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:04.3237105Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.3238528Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.3239536Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:04.3240175Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:04.3241116Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.3242444Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:04.3243388Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:04.3243825Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:04.3244173Z U at::globalContext() 2025-05-07T20:11:04.3244605Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.3245049Z U c10::BoolType::get() 2025-05-07T20:11:04.3245405Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:04.3245787Z U c10::FloatType::get() 2025-05-07T20:11:04.3246103Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:04.3246749Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.3247486Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:04.3247910Z U c10::IntType::get() 2025-05-07T20:11:04.3248310Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:04.3248728Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:04.3249209Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.3249637Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:04.3250061Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:04.3250525Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:04.3250976Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:04.3251680Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:04.3252339Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:04.3252764Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:04.3253166Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:04.3253635Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:04.3254003Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:04.3254361Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:04.3254770Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:04.3255111Z U c10::SymIntType::get() 2025-05-07T20:11:04.3255463Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:04.3255891Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:04.3256259Z U c10::TensorType::get() 2025-05-07T20:11:04.3256610Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:04.3257505Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:04.3258433Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:04.3258818Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:04.3259154Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:04.3259509Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:04.3259849Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:04.3260203Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:04.3260676Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:04.3261164Z U c10::cuda::device_count() 2025-05-07T20:11:04.3261524Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:04.3261926Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:04.3262329Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:04.3262736Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:04.3263141Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:04.3263539Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:04.3264237Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:04.3265073Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:04.3265900Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.3266792Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:04.3267774Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:04.3268587Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:04.3268916Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:04.3269301Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:04.3269714Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:04.3270128Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:04.3270496Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:11:04.3270841Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:04.3271223Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:04.3271614Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:04.3272016Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:04.3272379Z U c10::throwNullDataPtrError() 2025-05-07T20:11:04.3272719Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:04.3273066Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:04.3273500Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:04.3273928Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:04.3274265Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:04.3274638Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:04.3274995Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:04.3275372Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:04.3275739Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:04.3276175Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:04.3276719Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:04.3277093Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:04.3277558Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:04.3277944Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:04.3278350Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:04.3278735Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:04.3279098Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:04.3279469Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:04.3279883Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:04.3280281Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:04.3282763Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:04.3285241Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:04.3285715Z U float at::Tensor::item() const 2025-05-07T20:11:04.3286121Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.3286555Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.3286992Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.3287417Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.3287889Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:04.3288359Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:04.3288787Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:04.3289160Z U memcpy@GLIBC_2.14 2025-05-07T20:11:04.3289491Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:04.3289794Z U memset@GLIBC_2.2.5 2025-05-07T20:11:04.3290170Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:04.3290567Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:04.3291169Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.3292033Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.3292719Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.3293455Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.3294226Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.3294944Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:04.3295716Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:04.3296505Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:04.3297311Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:04.3298113Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:04.3298680Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:04.3299039Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:04.3299415Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.3299797Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.3300220Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:04.3300682Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:04.3301204Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:04.3301892Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:04.3302866Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.3304022Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:04.3304738Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:04.3305071Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:04.3305410Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.3305732Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:04.3306062Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:04.3306429Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:04.3306840Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.3307353Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:04.3307820Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:04.3308151Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:04.3308468Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:04.3308764Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:04.3309534Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:04.3310623Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.3311380Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:04.3312082Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:04.3313076Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:04.3315474Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.3319687Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.3323864Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.3327741Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.3331584Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.3335446Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:04.3339191Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:11:04.3341132Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:04.3341568Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:04.3341997Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:04.3342596Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.3343382Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.3344161Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:04.3344793Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:04.3345338Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:04.3345778Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:04.3346115Z w _ITM_registerTMCloneTable 2025-05-07T20:11:04.3346632Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:04.3347114Z w __gmon_start__ 2025-05-07T20:11:04.3347417Z w __pthread_key_create 2025-05-07T20:11:04.3347813Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:04.3348162Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:04.3348531Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:04.3350803Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:04.3351195Z 2025-05-07T20:11:04.3351313Z linux-vdso.so.1 (0x00007ffc5ebdd000) 2025-05-07T20:11:04.3351637Z libc10.so => not found 2025-05-07T20:11:04.3351930Z libc10_cuda.so => not found 2025-05-07T20:11:04.3352692Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fbba0a00000) 2025-05-07T20:11:04.3353495Z libtorch.so => not found 2025-05-07T20:11:04.3353763Z libtorch_cpu.so => not found 2025-05-07T20:11:04.3354076Z libtorch_cuda.so => not found 2025-05-07T20:11:04.3354365Z libcudart.so.12 => not found 2025-05-07T20:11:04.3354905Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fbba079c000) 2025-05-07T20:11:04.3355339Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fbbefda6000) 2025-05-07T20:11:04.3355757Z libc.so.6 => /lib64/libc.so.6 (0x00007fbba0594000) 2025-05-07T20:11:04.3356271Z /lib64/ld-linux-x86-64.so.2 (0x00007fbbefdda000) 2025-05-07T20:11:04.3356616Z libc10.so => not found 2025-05-07T20:11:04.3357048Z libc10_cuda.so => not found 2025-05-07T20:11:04.3357697Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fbba0200000) 2025-05-07T20:11:04.3358850Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fbba0050000) 2025-05-07T20:11:04.3359635Z libtorch.so => not found 2025-05-07T20:11:04.3360157Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007fbb9fa00000) 2025-05-07T20:11:04.3361123Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fbb9e800000) 2025-05-07T20:11:04.3361794Z libtorch_cpu.so => not found 2025-05-07T20:11:04.3362105Z libtorch_cuda.so => not found 2025-05-07T20:11:04.3362391Z libcudart.so.12 => not found 2025-05-07T20:11:04.3362720Z libm.so.6 => /lib64/libm.so.6 (0x00007fbbefcc7000) 2025-05-07T20:11:04.3363055Z libc10.so => not found 2025-05-07T20:11:04.3363321Z libc10_cuda.so => not found 2025-05-07T20:11:04.3363967Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007fbbefcb9000) 2025-05-07T20:11:04.3364673Z libtorch.so => not found 2025-05-07T20:11:04.3364965Z libtorch_cpu.so => not found 2025-05-07T20:11:04.3365242Z libtorch_cuda.so => not found 2025-05-07T20:11:04.3365547Z libcudart.so.12 => not found 2025-05-07T20:11:04.3365809Z libc10.so => not found 2025-05-07T20:11:04.3366069Z libc10_cuda.so => not found 2025-05-07T20:11:04.3366334Z libtorch.so => not found 2025-05-07T20:11:04.3366598Z libtorch_cpu.so => not found 2025-05-07T20:11:04.3366884Z libtorch_cuda.so => not found 2025-05-07T20:11:04.3367144Z libcudart.so.12 => not found 2025-05-07T20:11:04.3367419Z libc10.so => not found 2025-05-07T20:11:04.3367923Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007fbbefc3a000) 2025-05-07T20:11:04.3368616Z libtorch.so => not found 2025-05-07T20:11:04.3368856Z libtorch_cpu.so => not found 2025-05-07T20:11:04.3369117Z libtorch_cuda.so => not found 2025-05-07T20:11:04.3369369Z libtorch.so => not found 2025-05-07T20:11:04.3369619Z libc10.so => not found 2025-05-07T20:11:04.3369839Z libc10_cuda.so => not found 2025-05-07T20:11:04.3370102Z libtorch_cpu.so => not found 2025-05-07T20:11:04.3370359Z libtorch_cuda.so => not found 2025-05-07T20:11:04.3370611Z libcudart.so.12 => not found 2025-05-07T20:11:04.3370863Z libc10.so => not found 2025-05-07T20:11:04.3371084Z libtorch_cpu.so => not found 2025-05-07T20:11:04.3371396Z libtorch_cuda.so => not found 2025-05-07T20:11:04.3371642Z libtorch.so => not found 2025-05-07T20:11:04.3371895Z libtorch_cpu.so => not found 2025-05-07T20:11:04.3372180Z libtorch_cuda.so => not found 2025-05-07T20:11:04.3372441Z libtorch.so => not found 2025-05-07T20:11:04.3372593Z 2025-05-07T20:11:04.3372716Z [CHECK] Displaying ELF information: 2025-05-07T20:11:04.3373169Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:04.3373533Z 2025-05-07T20:11:04.3373585Z 2025-05-07T20:11:04.3373738Z Dynamic section at offset 0x148571f8 contains 39 entries: 2025-05-07T20:11:04.3374099Z Tag Type Name/Value 2025-05-07T20:11:04.3374500Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:04.3374982Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:04.3375510Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:04.3376056Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:04.3376525Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:04.3377047Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:04.3377585Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:04.3378066Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:04.3378554Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:04.3379006Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:04.3379489Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:04.3380044Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:11:04.3380564Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:04.3380950Z 0x000000000000000c (INIT) 0x1c3000 2025-05-07T20:11:04.3381274Z 0x000000000000000d (FINI) 0xf0879c 2025-05-07T20:11:04.3381613Z 0x0000000000000019 (INIT_ARRAY) 0x14856518 2025-05-07T20:11:04.3381943Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:11:04.3382299Z 0x000000000000001a (FINI_ARRAY) 0x148567c0 2025-05-07T20:11:04.3382621Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:04.3382952Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:04.3383328Z 0x000000006ffffef5 (GNU_HASH) 0x4b88 2025-05-07T20:11:04.3383633Z 0x0000000000000005 (STRTAB) 0x1fa30 2025-05-07T20:11:04.3383954Z 0x0000000000000006 (SYMTAB) 0xa208 2025-05-07T20:11:04.3384289Z 0x000000000000000a (STRSZ) 1419969 (bytes) 2025-05-07T20:11:04.3384644Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:04.3384976Z 0x0000000000000003 (PLTGOT) 0x148574a8 2025-05-07T20:11:04.3385328Z 0x0000000000000002 (PLTRELSZ) 18120 (bytes) 2025-05-07T20:11:04.3385668Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:04.3385981Z 0x0000000000000017 (JMPREL) 0x1bded8 2025-05-07T20:11:04.3386315Z 0x0000000000000007 (RELA) 0x17c2e0 2025-05-07T20:11:04.3386640Z 0x0000000000000008 (RELASZ) 269304 (bytes) 2025-05-07T20:11:04.3386989Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:04.3387282Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:04.3387598Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:04.3387920Z 0x000000006ffffffe (VERNEED) 0x17c1a0 2025-05-07T20:11:04.3388243Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:04.3388564Z 0x000000006ffffff0 (VERSYM) 0x17a4f2 2025-05-07T20:11:04.3388870Z 0x000000006ffffff9 (RELACOUNT) 7406 2025-05-07T20:11:04.3389207Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:04.3389401Z 2025-05-07T20:11:04.3389515Z ################################################################################ 2025-05-07T20:11:04.3389775Z 2025-05-07T20:11:04.3389779Z 2025-05-07T20:11:04.3389976Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:11:04.3441564Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:04.3469087Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:04.3711196Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:04.3749687Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:04.3799660Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:04.3841336Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:04.3881032Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:04.3910374Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:04.4021531Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.4052180Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.4281226Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.4315728Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.4377558Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.4416919Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.4462928Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.4501163Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.4905727Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.5274954Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.5469645Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.6416630Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.6454873Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.6544668Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.6859593Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:04.6861503Z ################################################################################ 2025-05-07T20:11:04.6862056Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:04.6862628Z 2025-05-07T20:11:04.6863100Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:04.6863677Z 2025-05-07T20:11:16.4918535Z 2025-05-07T20:11:16.4919872Z fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl is 2025-05-07T20:11:16.4921425Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:11:16.4922343Z 2025-05-07T20:11:16.4922833Z The wheel references external versioned symbols in these 2025-05-07T20:11:16.4924201Z system-provided shared libraries: libgcc_s.so.1 with versions 2025-05-07T20:11:16.4925443Z {'GCC_3.4', 'GCC_3.0'}, libstdc++.so.6 with versions 2025-05-07T20:11:16.4926633Z {'GLIBCXX_3.4.14', 'CXXABI_1.3.8', 'GLIBCXX_3.4', 'CXXABI_1.3.9', 2025-05-07T20:11:16.4927352Z 'GLIBCXX_3.4.19', 'GLIBCXX_3.4.20', 'GLIBCXX_3.4.29', 'CXXABI_1.3.11', 2025-05-07T20:11:16.4927820Z 'GLIBCXX_3.4.18', 'GLIBCXX_3.4.11', 'GLIBCXX_3.4.9', 'CXXABI_1.3', 2025-05-07T20:11:16.4928269Z 'CXXABI_1.3.7', 'GLIBCXX_3.4.21', 'CXXABI_1.3.5', 'CXXABI_1.3.3', 2025-05-07T20:11:16.4928723Z 'GLIBCXX_3.4.15'}, libc.so.6 with versions {'GLIBC_2.14', 2025-05-07T20:11:16.4929455Z 'GLIBC_2.2.5'}, libm.so.6 with versions {'GLIBC_2.2.5'}, 2025-05-07T20:11:16.4929844Z libcudart.so.12 with versions {'libcudart.so.12'} 2025-05-07T20:11:16.4930123Z 2025-05-07T20:11:16.4930342Z This constrains the platform tag to "manylinux_2_34_x86_64". In order 2025-05-07T20:11:16.4930843Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:11:16.4931332Z wheel from source on a system with earlier versions of these 2025-05-07T20:11:16.4931733Z libraries, such as a recent manylinux image. 2025-05-07T20:11:16.5720515Z 2025-05-07T20:11:16.5720597Z 2025-05-07T20:11:16.5721156Z ################################################################################ 2025-05-07T20:11:16.5722280Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:11:16.5723663Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:16.5724741Z 2025-05-07T20:11:16.5742282Z -rw-r--r--. 1 root root 511M May 7 20:11 dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:16.5743686Z 2025-05-07T20:11:16.5744060Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:11:16.5745402Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:16.5746969Z 2025-05-07T20:11:17.4908977Z 3d8f72b9b95748bf4fc87df11b83791aebde5b6e dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:17.4909580Z 2025-05-07T20:11:17.4909854Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:17.4910232Z 2025-05-07T20:11:19.6720576Z 8b67967ad93c8df5407b62fb843628b91fbd0458321363d951e36ab80c70f872 dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:19.6722517Z 2025-05-07T20:11:19.6723295Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:19.6724351Z 2025-05-07T20:11:20.4823322Z cb7d99700a89ff6f26d445ceb8d66573 dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:11:20.4824865Z 2025-05-07T20:11:20.4825250Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:11:20.4935330Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:11:20.4935658Z with: 2025-05-07T20:11:20.4935898Z name: fbgemm_default_x86_gcc_py3.10_cu12.6.3.whl 2025-05-07T20:11:20.4936210Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:11:20.4936482Z if-no-files-found: error 2025-05-07T20:11:20.4936722Z compression-level: 6 2025-05-07T20:11:20.4936979Z overwrite: false 2025-05-07T20:11:20.4937196Z include-hidden-files: false 2025-05-07T20:11:20.4937445Z env: 2025-05-07T20:11:20.4937674Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:11:20.4937961Z BUILD_ENV: build_binary 2025-05-07T20:11:20.4938217Z BUILD_TARGET: default 2025-05-07T20:11:20.4938434Z BUILD_VARIANT: cuda 2025-05-07T20:11:20.4938673Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T20:11:20.4938909Z ##[endgroup] 2025-05-07T20:11:20.4942061Z ##[command]/usr/bin/docker exec 12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:20.9224333Z With the provided path, there will be 1 file uploaded 2025-05-07T20:11:20.9225111Z Artifact name is valid! 2025-05-07T20:11:20.9225400Z Root directory input is valid! 2025-05-07T20:11:21.0104887Z Beginning upload of artifact content to blob storage 2025-05-07T20:11:21.8394538Z Uploaded bytes 8388608 2025-05-07T20:11:22.3096424Z Uploaded bytes 16777216 2025-05-07T20:11:22.7865654Z Uploaded bytes 25165824 2025-05-07T20:11:23.2883577Z Uploaded bytes 33554432 2025-05-07T20:11:23.8638288Z Uploaded bytes 41943040 2025-05-07T20:11:24.3885731Z Uploaded bytes 50331648 2025-05-07T20:11:24.8836177Z Uploaded bytes 58720256 2025-05-07T20:11:25.3790108Z Uploaded bytes 67108864 2025-05-07T20:11:25.8621054Z Uploaded bytes 75497472 2025-05-07T20:11:26.3862148Z Uploaded bytes 83886080 2025-05-07T20:11:26.9297128Z Uploaded bytes 92274688 2025-05-07T20:11:27.4479423Z Uploaded bytes 100663296 2025-05-07T20:11:27.9076436Z Uploaded bytes 109051904 2025-05-07T20:11:28.5644873Z Uploaded bytes 117440512 2025-05-07T20:11:28.9936212Z Uploaded bytes 125829120 2025-05-07T20:11:29.4868618Z Uploaded bytes 134217728 2025-05-07T20:11:29.9881213Z Uploaded bytes 142606336 2025-05-07T20:11:30.4879161Z Uploaded bytes 150994944 2025-05-07T20:11:31.0687244Z Uploaded bytes 159383552 2025-05-07T20:11:31.6586020Z Uploaded bytes 167772160 2025-05-07T20:11:32.0859269Z Uploaded bytes 176160768 2025-05-07T20:11:32.6098379Z Uploaded bytes 184549376 2025-05-07T20:11:33.0836113Z Uploaded bytes 192937984 2025-05-07T20:11:33.5806001Z Uploaded bytes 201326592 2025-05-07T20:11:34.0828257Z Uploaded bytes 209715200 2025-05-07T20:11:34.6828738Z Uploaded bytes 218103808 2025-05-07T20:11:35.1351283Z Uploaded bytes 226492416 2025-05-07T20:11:35.6525832Z Uploaded bytes 234881024 2025-05-07T20:11:36.1564249Z Uploaded bytes 243269632 2025-05-07T20:11:36.6535356Z Uploaded bytes 251658240 2025-05-07T20:11:37.1035471Z Uploaded bytes 260046848 2025-05-07T20:11:37.6122606Z Uploaded bytes 268435456 2025-05-07T20:11:38.0788237Z Uploaded bytes 276824064 2025-05-07T20:11:38.6466328Z Uploaded bytes 285212672 2025-05-07T20:11:39.1676219Z Uploaded bytes 293601280 2025-05-07T20:11:39.6975898Z Uploaded bytes 301989888 2025-05-07T20:11:40.1420559Z Uploaded bytes 310378496 2025-05-07T20:11:40.7084822Z Uploaded bytes 318767104 2025-05-07T20:11:41.1866548Z Uploaded bytes 327155712 2025-05-07T20:11:41.7267352Z Uploaded bytes 335544320 2025-05-07T20:11:42.2162367Z Uploaded bytes 343932928 2025-05-07T20:11:42.8139205Z Uploaded bytes 352321536 2025-05-07T20:11:43.4763979Z Uploaded bytes 360710144 2025-05-07T20:11:43.9117069Z Uploaded bytes 369098752 2025-05-07T20:11:44.3709317Z Uploaded bytes 377487360 2025-05-07T20:11:44.9297448Z Uploaded bytes 385875968 2025-05-07T20:11:45.3895171Z Uploaded bytes 394264576 2025-05-07T20:11:45.8576264Z Uploaded bytes 402653184 2025-05-07T20:11:46.3246086Z Uploaded bytes 411041792 2025-05-07T20:11:46.8699077Z Uploaded bytes 419430400 2025-05-07T20:11:47.3759573Z Uploaded bytes 427819008 2025-05-07T20:11:47.8158262Z Uploaded bytes 436207616 2025-05-07T20:11:48.2976889Z Uploaded bytes 444596224 2025-05-07T20:11:48.8256328Z Uploaded bytes 452984832 2025-05-07T20:11:49.3558136Z Uploaded bytes 461373440 2025-05-07T20:11:49.7293421Z Uploaded bytes 469762048 2025-05-07T20:11:50.3113462Z Uploaded bytes 478150656 2025-05-07T20:11:50.6952076Z Uploaded bytes 486539264 2025-05-07T20:11:51.1978336Z Uploaded bytes 494927872 2025-05-07T20:11:51.6238450Z Uploaded bytes 503316480 2025-05-07T20:11:52.1516013Z Uploaded bytes 511705088 2025-05-07T20:11:52.7189663Z Uploaded bytes 520093696 2025-05-07T20:11:53.0081676Z Uploaded bytes 524579217 2025-05-07T20:11:53.0285076Z Finished uploading artifact content to blob storage! 2025-05-07T20:11:53.0285801Z SHA256 digest of uploaded artifact zip is 4899568cd6ab68e157d59aee8914434d16aa80441e116c9ae2d744d4dea6d398 2025-05-07T20:11:53.0287621Z Finalizing artifact upload 2025-05-07T20:11:53.1249174Z Artifact fbgemm_default_x86_gcc_py3.10_cu12.6.3.whl.zip successfully finalized. Artifact ID 3081459806 2025-05-07T20:11:53.1250209Z Artifact fbgemm_default_x86_gcc_py3.10_cu12.6.3.whl has been successfully uploaded! Final size is 524579217 bytes. Artifact ID is 3081459806 2025-05-07T20:11:53.1263123Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081459806 2025-05-07T20:11:53.1532105Z Post job cleanup. 2025-05-07T20:11:53.1537552Z ##[command]/usr/bin/docker exec 12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:53.4502956Z [command]/usr/bin/git version 2025-05-07T20:11:53.4536959Z git version 2.47.1 2025-05-07T20:11:53.4567944Z Copying '/github/home/.gitconfig' to '/__w/_temp/1b1d89fd-2fbc-48ff-adc5-55122cdd5810/.gitconfig' 2025-05-07T20:11:53.4577173Z Temporarily overriding HOME='/__w/_temp/1b1d89fd-2fbc-48ff-adc5-55122cdd5810' before making global git config changes 2025-05-07T20:11:53.4578197Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:11:53.4588984Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:11:53.4636685Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:11:53.4667567Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:11:53.4948058Z Entering 'external/asmjit' 2025-05-07T20:11:53.4997323Z Entering 'external/composable_kernel' 2025-05-07T20:11:53.5052796Z Entering 'external/cpuinfo' 2025-05-07T20:11:53.5103121Z Entering 'external/cutlass' 2025-05-07T20:11:53.5162306Z Entering 'external/googletest' 2025-05-07T20:11:53.5214064Z Entering 'external/hipify_torch' 2025-05-07T20:11:53.5259594Z Entering 'external/json' 2025-05-07T20:11:53.5326651Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:11:53.5346960Z http.https://github.com/.extraheader 2025-05-07T20:11:53.5351329Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:11:53.5377918Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:11:53.5715701Z Entering 'external/asmjit' 2025-05-07T20:11:53.5749437Z http.https://github.com/.extraheader 2025-05-07T20:11:53.5784506Z Entering 'external/composable_kernel' 2025-05-07T20:11:53.5822917Z http.https://github.com/.extraheader 2025-05-07T20:11:53.5869015Z Entering 'external/cpuinfo' 2025-05-07T20:11:53.5905246Z http.https://github.com/.extraheader 2025-05-07T20:11:53.5940374Z Entering 'external/cutlass' 2025-05-07T20:11:53.5977450Z http.https://github.com/.extraheader 2025-05-07T20:11:53.6021829Z Entering 'external/googletest' 2025-05-07T20:11:53.6061684Z http.https://github.com/.extraheader 2025-05-07T20:11:53.6094946Z Entering 'external/hipify_torch' 2025-05-07T20:11:53.6128053Z http.https://github.com/.extraheader 2025-05-07T20:11:53.6160205Z Entering 'external/json' 2025-05-07T20:11:53.6197208Z http.https://github.com/.extraheader 2025-05-07T20:11:53.6395957Z Stop and remove container: 0d427c0d6c6f41979bf3159e20b740ae_amazonlinux2023_c08d76 2025-05-07T20:11:53.6401292Z ##[command]/usr/bin/docker rm --force 12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf 2025-05-07T20:11:54.4524525Z 12a11cea79f2a56e791870b9b2b1e53d02b52a3ff76d9efaab3e95260cbab6cf 2025-05-07T20:11:54.4562709Z Remove container network: github_network_94fc62e5ee044bf697e58aee19d01a64 2025-05-07T20:11:54.4567248Z ##[command]/usr/bin/docker network rm github_network_94fc62e5ee044bf697e58aee19d01a64 2025-05-07T20:11:55.3890901Z github_network_94fc62e5ee044bf697e58aee19d01a64 2025-05-07T20:11:55.3934145Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:11:55.3955939Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:11:55.3962371Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:11:55.3962928Z ##[endgroup] 2025-05-07T20:11:55.4073032Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:12:05.3928791Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:12:21.3430509Z Cleaning up orphan processes